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METHODS AND ORGANISMS FOR 
PRODUCTION OF B6 VITAMERS 

Background of the Invention 

Vitamin B6, also known as pyridoxine or pyridoxol (PN), or one of a 
number of closely related compounds, is an essential dietary nutrient for most, if not all, 
animals, while many micro-organisms (bacteria, fungi, algae, etc.) and plants are 
capable of synthesizing their own vitamin B6 or compound(s) related to vitamin B6. 
When an animal ingests PN or a related compound that has vitamin B6 activity, the 
1 0 compound is converted ultimately into pyridoxal phosphate (PLP) and/or pryidoxamine 
phosphate (PMP), which are the active forms of vitamin B6 in all living organisms. PLP 
acts as a cofactor for many important or essential enzymes in all living organisms, 
including transaminases, racemases, and decarboxylases. PLP and PMP are easily 
interconverted byiubiquitous transaminases. 
1 5 Vitamin B6 is of commercial importance in vitamin pills, pharmaceutical 

applications, and as an animal feed additive that enhances growth or desirable growth 
characteristics in farm and domestic animals. The currently used commercial process 
for producing vitamin B6 is a synthetic chemical process. However, a fermentation 
process using a microorganism (see US Patent application No. 09/667,569, filed 
20 September 21 , 2000, hereby incorporated in its entirety by reference) or a biosynthetic 
process using a plant species can be more cost effective in the long run, and may be 
environmentally more attractive. 

The biosynthetic pathway for PLP in E. coli has been elucidated 
(reviewed in Mittengruber, G., (2001) J. Mol. Microbiol. Biotechnol. 3(1): 1-20; Cane, 
25 D.E., et al. (2000) J. Am. Chem. Soc. 122: 4213-4214; Man, T-K, et al., (1996) J. 

Bacteriol. 1 78: 2445-2449). Enzymes encoded by the genes epd, pdxB, pdxF, saApdxA 
lead to synthesis of the precursor l-hydroxy-3-amino acetone phosphate from erythrose- 
4-phosphate and glutamate. The enzyme encoded by dxs leads to the precursor 5'- 
deoxyxylulose phosphate from glycolytic intermediates. The enzyme encoded by pdxJ 
30 then catalyzes the chemical coupling of the two precursors to give pyridoxol phosphate 
(also called pyridoxine phosphate or PNP). PNP is then oxidized to the active form, 
PLP, by the enzyme encoded by pdxH. This biosynthetic pathway to PLP in E. coli, as 
well as closely related pathways, are referred to herein as the Type A Pathway. Partially 
characterized mutants of E. coli have been described that produce about three- to seven- 
35 fold more vitamin B 6 -related compounds than the parent strain (Dempsey and Arcement 
(1971) J. Bacteriol. 107(2): 580-582). Partially characterized mutants of B. subtilis have 
been reported that produce 1 - 5 mg/1 vitamin B 6 , but it was not stated what level the 



ATTORNEY DOCKET NO: BGI-152-1 



parent strain produced (Pflug, W., and Lingens, F., (1 978) Hoppe-Seyler's Z. Physiol. 
Chem. 359: 559-570). Notably, these organisms were not recombinantly produced. 

A second biosynthetic pathway for vitamin B6, referred to herein as the 
Type B pathway, may exist in some organisms other than E. coli (Mittengruber, G., 
5 (2001) J. Mol. Microbiol Biotechnol. 3(l):l-20). In particular, some fungi (for example 
from the genera Cercospora, Neurospora, Aspergillus and Saccharomyces), some 
bacteria (for example B. subtilis and Staphylococcus aureus), and all plants for which 
data exists do not contain any genes that are highly homologous to E. colipdxA and 
pdxJ. Instead, these organisms contain genes that are homologous to Cercospora genes 
10 named SOR (or SNZ) and SNO. In Saccharomyces, these homologs are called PDX1 and 
PDX2, respectively, and inB. subtilis, these homologs are named yaaD and yaaE, 
. respectively. In B. subtilis, there have been no reports as to whether yaaD or yaaE are 
actually involved in PLP biosynthesis. Protein or DNA sequence homology alone is not 
. sufficient to establish biological function. For example, B. subtilis contains a gene, 
15 [ yhaF, that encodes a protein that is significantly homologous to E. colipdxF. However, 
when yhaF is mutated, the resulting mutant B. subtilis strain is a serine auxotroph, but 
not a PL auxotroph (see Example 3, below). Thus, the identification of a gene or genes 
involved in PLP biosynthesis in any given organism can not be done using sequence 
homology alone. 

20 Results from 13 C and ,S N labeling studies suggest that the precursors that 

provide the carbon and nitrogen atoms in PL and related compounds are different in E. 
coli and Saccharomyces cerevisiae (Gupta, R, et al. (2001) J. Am Chem. Soc. 123: 
1 1353-11359; Tayuza, K., et al. (1995) Biochim. Biophys. Acta 1244 : 1 13-116.) 
However, the identity of the precursors for PL and related compounds in S. cerevisiae is 
25 not yet known. Since most micro-organisms for which the entire genome sequence is 
known (for example E. coli, S. cerevisiae and B. subtilis) have either pdxA smdpdxJ 
homologs or SOR and SNO homologs, but not both, it appears that most organisms that 
are capable of synthesizing PLP have either the well characterized Type A Pathway (for 
example E. coli, Salmonella typhimurium, and many other genera), or a distinctly 
30 different and incompletely characterized pathway, e.g., the Type B Pathway. 
Specifically, members of the genera Cercospora, Neurospora, Aspergillus, 
Saccharomyces, Bacillus, Arabidopsis, and many other genera, appear to have a Type B 
pathway, and are lacking genes involved in the Type A Pathway. The intermediate 
compounds in the Type B Pathway have not yet been elucidated, although the final 
35 product must be PLP (as for the Type A Pathway) or PMP, since these are the active 
forms of vitamin B6 in all known organisms. 
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Summary of the Invention 

The present invention is based, at least in part, on the discovery of key 
enzyme-encoding genes of the B6 vitamer biosynthetic pathways in Bacillus subtilis. In 
particular, the invention is based, at least in part, on the discovery that the yaaD and 

5 yaaE genes of B. subtilis are required for B6 vitamer synthesis. Deletion of a portion of 
iheyaaD and yaaE genes (which are adjacent in an operon, e.g., the yaaDE operon) 
leads to PL auxotrophy. Overexpression of the yaaDE operon or the deregulation of the 
expression of the yaaD or yaaE genes leads to significantly increased production of B6 
vitamers in, e.g., B. subtilis strains. The B. subtilis yaaDE operon is required for 

10 pyridoxal phosphate (PLP) biosynthesis, an active form of vitamin B6 in all living 

organisms. The present invention describes that the expression of the B. subtilis yaaDE 
operon is a rate limiting step for production of compounds related to vitamin B6 in a 
wild type strain. 

Accordingly, the present invention features methods of producing B6 
15 vitamers, including, but not limited to, pyridoxine (or pyridoxol (PN)), pyridoxal (PL), 
pyridoxamine (PM), or the 5' phosphorylated derivatives of any of the three 
aforementioned compounds (PNP, PLP, and PMP), using organisms in which the B6 
vitamer pathway has been manipulated such that B6 vitamers are produced. Such 
methods include culturing a microorganism that overexpresses at least one B6 vitamer 
20 biosynthetic enzyme (e.g., at least one of the yaaD ox yaaE gene products), under 
conditions such that the B6 vitamer is produced. The production methods of the 
present invention further can include recovering the B6 vitamer. 

The instant invention also features genetically modified organisms (i.e., 
organisms that contain one or more modifications or mutations in the genome) that are 
25 capable of producing significantly more of a B6 vitamer than an unmodified parent 
organism. In particular, this invention features micro-organisms (including, for 
example, but not limited to, bacteria, yeasts, fungi, and algae) or macro-organisms such 
as plants that, when genetically modified, produce an increased amount, e.g., at least 
about 10-fold more of a B6 vitamer, than the unmodified parent organism. Specific 
30 examples are given herein in which Bacillus subtilis and Escherichia coli strains have 
been genetically modified such that they produce significant amounts of a B6 vitamer. 
Accordingly, the present invention features orgamsms that have been genetically 
modified to increase the activity of one or more enzymes that catalyze(s) a step in the 
biosynthesis of a B6 vitamer, such that B6 vitamer production from said modified 
35 organism is increased compared to B6 production in an unmodified parent organism. 

Yet another aspect of Ihe invention features recombinant microorganisms 
which overexpress at least one Bacillus (e.g., B. subtilis) B6 vitamer biosynthetic 
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enzyme (e.g., at least one of the yaaD oxyaaE gene products) are described. In one 
embodiment, the recombinant microorganism is Gram positive (e.g., microorganisms 
belonging to the genus Bacillus, Cornyebacterium, Lactobacillus, Lactococci or 
Streptomyces). In another embodiment, the recombinant microorganism is Gram 
negative. Particularly preferred is a Bacillus recombinant microorganism (e.g., Bacillus 
licheniformis, Bacillus amyloliquefaciens, Bacillus subtilis, Bacillus pumilus, Bacillus 

halodwans, and the like). 

Recombinant vectors that contain genes encoding Bacillus B6 vitamer 
biosynthetic enzymes, e.g., yaaD or yaaE genes, are also described. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 



Brief Description of the Drawings 

j 5 Figure 1 depicts the chemical structures of vitamin B6 and related 

compounds. 

Figure 2 depicts the biosynthetic pathway for PLP in K coli. 
Figure 3 depicts the standard curves generated by Saccharomyces 
unarum strain ATCC 9080 after feeding serial dilutions of PN, PL, and PM (as 
20 described in Example 1). 

Figure 4 is a schematic representation of the plasmid pDXIF. 
Figure 5 is a schematic representation of the plasmid pDXl IF. 
Figure 6 is a schematic representation of the plasmid pDX14R. 
Figure 7 is a schematic representation of the plasmid pDX17R. 

25 

Detailed Description of the Invention 

The present invention is based, at least in part, on the identification of 
Bacillus (e.g., B. subtilis) genes that encode essential enzymes of the B6 vitamer 
biosynthetic pathway. In particular, the present invention features methods based on 

30 manipulation of the B6 vitamer biosynthetic pathway in a microorganism such that 
certain desirable compounds are produced. 

In particular, the invention is based, at least in part, on the discovery that 
the yaaD and yaaE genes of B. subtilis are required for B6 vitamer synthesis, including, 
but not limited to, pyridoxine (or pyridoxol (PN)), pyridoxal (PL), pyridoxamine (PM), 

35 or the 5 ' phosphorylated derivatives of any of the Ihree aforementioned compounds 
(PNP, PLP, and PMP). The yaaD and yaaE genes are adjacent on an operon, e.g., the 
yaaDE operon. The yaaD and yaaE genes encode the YaaD and Yaa E proteins, 
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respectively. Overexpression of the yaaDE operon with a strong constitutive promoter 
or the deregulation of the expression of the yaaD otyaaE gene(s) leads to significantly 
increased production of B6 vitamers. These quantities are significantly higher relative 
to the associated parent strains than those reported in previous studies, which have 
5 employed mutant K coli strains (Dempsey and Arcement (1971) J. Bacteriol. 107 (2): 
580-582), or mutant B. subtilis strains (Pflug, W., and Lingens, F., (1978) Hoppe- 
Seyler's Z. Physiol. Chem. 359: 559-570). 

Accordingly, the present invention features organisms that have been 
genetically modified to increase the activity of one or more enzymes that catalyze a step 
10 in the biosynthesis of a B6 vitamer, such that B6 vitamer production from the modified 
organism is increased compared to B6 production in an unmodified parent organism. In 
one embodiment, B6 vitamer production is at least ten-fold higher than from the 
gj unmodified parent organism. In another embodiment, the organism is genetically 

5 modified to overexpress one or more genes that encodes an enzyme that catalyzes a step 

f\ 15 in the biosynthesis of a B6 vitamer, e.g., j«ia£> or yaaE. The organism may be, for 
q example, B. subtilis. 

tQ The present invention also features methods of producing a B6 vitamer 

* comprising culturing a microorganism that has been genetically modified to overexpress 

O one or more genes that encodes an enzyme that catalyzes a step in the biosynthesis of a 

W 20 B6 vitamer, such that B6 vitamer production from said modified organism is increased 
% compared to B6 production in an unmodified parent organism, under conditions such 

b that the B6 vitamer is produced. The B6 vitamer may then be subsequently recovered. 

ry Overproduction of the rate limiting enzyme for B6 vitamer production in any organism 

that is capable of producing B6 vitamers will lead to overproduction of B6 vitamers. 
25 The terms "B6 vitamer" or "B6 vitamers," as used herein, shall refer to 

any compound or mixture of compounds that has any biological activity in any 
biological assay for vitamin B6. B6 vitamers include, but are not limited to, pyridoxine 
(also called pyridoxol or PN), pyridoxal (PL), pyridoxamine (PM), the 5' 
phosphorylated derivatives of any of the three aforementioned compounds (PNP, PLP, 
30 and PMP), and any derivative or related compound that can be converted to the active 
forms (PLP or PMP) in a test organism. Thus, for example, the acetate esters or other 
esters of any of the available hydroxyl groups of any of the aforementioned six 
compounds, and which are likely to be hydrolyzed by specific or non-specific esterases, 
are included in B 6 vitamers. Also, various salts, such as hydrochloride salts, of any of 
35 the aforementioned compounds are included in B6 vitamers. 

The term "B6 vitamer biosynthetic pathway" includes the biosynthetic 
pathway involving B6 vitamer biosynthetic enzymes (e.g., polypeptides encoded by 
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biosyntbetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 
intermediates or products), cofactors and the like utilized in the formation or synthesis 
of B6 vitamers. The term "B6 vitamer biosynthetic pathway" includes the biosyntbetic 
pathway leading to the synthesis of B6 vitamers in a microorganism (e.g. , in vivo) as 
5 well as the biosynthetic pathway leading to the synthesis of B6 vitamers in vitro. 

A "biological assay for a B6 vitamer" includes, for example, any assay 
• that is capable of quantifying B6 vitamer activity by measuring growth of an organism 
that requires the feeding of a B6 vitamer (i.e., a compound that the fed organism can 
convert into PLP or PMP) for growth. Samples to be assayed are diluted serially in an 
10 appropriate medium and fed to the appropriate organism. Standard curves are generated 
by serially diluting known amounts of PL, PN, or PM, and feeding these dilutions to the 
test organism. By comparing dilutions of the unknown samples to the standard curves, 
total B6 vitamer activity can be determined, for example as PL equivalents if PL was 
used to generate the standard curve. 



15 



Various aspects of the invention are described in further detail in the 
following subsections. 



/ Genes Encoding Various B6 Vitamer Bio synthetic Enzymes 
20 In one embodiment, the present invention features targeting or modifying 

various biosynthetic genes or enzymes of the B6 vitamer biosynthetic pathway. In 
particular, the invention features modifying various enzymatic activities associated with 
said pathways by modifying or altering the genes encoding said biosynthetic enzymes. 

The term "gene", as used herein, includes a nucleic acid molecule (e.g. , a 
25 DNA molecule or segment thereof) that, in an organism, can be separated from another 
gene or other genes, by intergenic DNA (/.*., intervening or spacer DNA which 
naturally flanks the gene and/or separates genes in the chromosomal DNA of the 
organism). Alternatively, a gene may slightly overlap another gene (e.g., the 3' end of a 
first gene overlapping the 5' end of a second gene), said overlapping genes separated 
30 from other genes by intergenic DNA. A gene may direct synthesis of an enzyme or 

other protein molecule (e.g., may comprise coding sequences, for example, a contiguous 
open reading frame (ORF) which encodes a protein) or may itself be functional in the 
organism. A gene in an organism, may be clustered in an operon, as defined herein, said 
operon being separated from other genes and/or operons by the intergenic DNA. An 
35 "isolated gene", as used herein, includes a gene which is essentially free of sequences 
which naturally flank the gene in the chromosomal DNA of the organism from which 
the gene is derived (i.e., is free of adjacent coding sequences which encode a second or 
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distinct protein, adjacent structural sequences or the like) and optionally includes 5 1 and 

3' regulatory sequences, for example promoter sequences and/or terminator sequences. 

In one embodiment, an isolated gene includes predominantly coding sequences for a 

protein (e.g., sequences which encode Bacillus proteins). In another embodiment, an 

isolated gene includes coding sequences for a protein (e.g., for a Bacillus protein) and 

adjacent 5' and/or 3' regulatory sequences from the chromosomal DNA of the organism 

from which the gene is derived (e.g., adjacent 5' and/or 3' Bacillus regulatory 

sequences). Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kb, 1 kb, 

0.5 kb, 0.2 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences that naturally flank 

the gene in the chromosomal DNA of the organism from which the gene is derived. 

The term "operon" includes at least two adjacent genes or ORFs, 

optionally overlapping in sequence at either the 5' or 3' end of at least one gene or ORF. 

The term "operon" includes a coordinated unit of gene expression that contains a 

promoter and possibly a regulatory element associated with one or more adjacent genes 

or ORFs (e.g., structural genes encoding enzymes, for example, biosynthetic en2ymes). 

Expression of the genes (e.g:, structural genes) can be coordinate^ regulated, for 

example, by regulatory proteins binding to the regulatory element or by anti-termination 

of transcription. The genes of an operon (e.g. , structural genes) can be transcribed to 

give a single mRNA that encodes all of the proteins. 

A "gene having a mutation" or "mutant gene" as used herein, includes a 

gene having a nucleotide sequence which includes at least one alteration (e.g., 
substitution, insertion, deletion) such that the polypeptide or protein encoded by said 
mutant exhibits an activity that differs from the polypeptide or protein encoded by the 
wild-type nucleic acid molecule or gene. In one embodiment, a gene having a mutation 
; or mutant gene encodes a polypeptide or protein having an increased activity as 
compared to the polypeptide or protein encoded by the wild-type gene, for example, 
when assayed under similar conditions (e.g., assayed in microorganisms cultured at the 
same temperature). As used herein, an "increased activity" or "increased enzymatic 
activity" is one that is at least 5% greater than that of the polypeptide or protein encoded 
0 by the wild-type nucleic acid molecule or gene, preferably at least 5-10% greater, more 
preferably at least 10-25% greater and even more preferably at least 25-50%, 50-75% or 
75-100% greater than that of the polypeptide or protein encoded by the wild-type 
nucleic acid molecule or gene. Ranges intermediate to the above-recited values, e.g., 
75-85%, 85-90%, 90-95%, are also intended to be encompassed by the present 
35 invention. As used herein, an "increased activity" or "increased enzymatic activity" can 
alsolnclude an activity that is at least 1.25-fold greater than the activity of the 
polypeptide or protein encoded by the wild-type gene, preferably at least 1.5-fold 
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greater, more preferably at least 2-fold greater and even more preferably at least 3-fold, 
4-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100-fold or greater than the activity of the 
polypeptide or protein encoded by the wild-type gene. 

In another embodiment, a gene having a mutation or mutant gene 
5 encodes a polypeptide or protein having a reduced activity as compared to the 

polypeptide or protein encoded by the wild-type gene, for example, when assayed under 
similar conditions (e.g., assayed in microorganisms cultured at the same temperature). 
A mutant gene also can encode no polypeptide or have a reduced level of production of 
the wild-type polypeptide. As used herein, a "reduced activity" or "reduced enzymatic 
10 activity" is one that is at least 5% less than that of the polypeptide or protein encoded by 
the wild-type nucleic acid molecule or gene,' preferably at least 5-10% less, more 
preferably at least 10-25% less and even more preferably at least 25-50%, 50-75% or 
75-100% less than that of the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 
1 5 85-90%, 90-95%, are also intended to be encompassed by the present invention. As 
used herein, a "reduced activity" or "reduced enzymatic activity" can also include an 
activity that has been deleted or "knocked out" (e.g, approximately 100% less activity 
than that of the polypeptide or protein encoded by the wild-type nucleic acid molecule 
or gene). 

20 Activity can be determined according to any well accepted assay for 

measuring activity of a particular protein of interest. Activity can be measured or 
assayed directly, for example, measuring an enzymatic or biological activity of a protein 
isolated or purified from a cell or microorganism. Alternatively, an activity can be 
measured or assayed within a cell or mocroorganism or in an extracellular medium. For 
25 example, assaying for a mutant gene (i.e., said mutant encoding a reduced enzymatic 
activity) can be accomplished by expressing the mutated gene in a microorganism, for 
example, a mutant microorganism in which the enzyme is temperature-sensitive, and 
assaying the mutant gene for the ability to complement a temperature sensitive (Ts) 
mutant for enzymatic activity. A mutant gene that encodes an "increased enzymatic 
30 activity" can be one that complements the Ts mutant more effectively than, for example, 
a corresponding wild-type gene. A mutant gene that encodes a "reduced enzymatic 
activity" is one that complements the Ts mutant less effectively than, for example, a 

corresponding wild-type gene. 

It will be appreciated by the skilled artisan that even a single substitution 
35 in a nucleic acid or gene sequence {e.g., a base substitution that encodes an amino acid 
change in the corresponding amino acid sequence) can dramatically affect the activity of 
an encoded polypeptide or protein as compared to the corresponding wild-type 
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polypeptide or protein. A mutant gene (e.g., encoding a mutant polypeptide or protein), 
as defined herein, is readily distinguishable from a nucleic acid or gene encoding a 
protein homologue in that a mutant gene encodes a protein or polypeptide having an 
altered activity, optionally observable as a different or distinct phenotype in a 
5 microorganism expressing said mutant gene or producing said mutant protein or 
polypeptide (i.e. , a mutant microorganism) as compared to a corresponding 
microorganism expressing the wild-type gene. By contrast, a protein homologue has an 
identical or substantially similar activity, optionally phenotypically indiscernible when 
produced in a microorganism, as compared to a corresponding microorganism 

1 0 expressing the wild-type gene. Accordingly it is not, for example, the degree of 

sequence identity between nucleic acid molecules, genes, protein or polypeptides that 
serves to distinguish between homologues and mutants, rather it is the activity of the 
encoded protein or polypeptide that distinguishes between homologues and mutants: 
homologues having, for example, low (e.g., 30-50% sequence identity) sequence 

1 5 identity yet having substantially equivalent functional activities, and mutants, for 
example sharing 99% sequence identity yet having dramatically different or altered 
functional activities. 

In a preferred embodiment, the genes of the present invention are derived 
from Bacillus. The term "derived from Bacillus" or "Bacillus-derived" includes a gene 

20 which is naturally found in microorganisms of the genus Bacillus. In another preferred 
embodiment, the genes of the present invention are derived from a microorganism 
selected from the group consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus 
lentus, Bacillus firmus, Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus 
cereus, Bacillus circulans, Bacillus coagulans, Bacillus licheniformis, Bacillus 

25 megaterium, Bacillus pumilus, Bacillus thuringiensis, Bacillus halodurans, and other 
Group 1 Bacillus species, for example, as characterized by 16S rRNA type. In another 
preferred embodiment, the gene is derived from Bacillus brevis or Bacillus 
stearothermophilus. In another preferred embodiment, the genes of the present 
invention are derived from a microorganism selected from the group consisting of 

30 Bacillus licheniformis, Bacillus amyloliquefaciens, Bacillus subtilis, and Bacillus 
pumilus. In a particularly preferred embodiment, the gene is derived from Bacillus 
subtilis (e.g., is Bacillus subtilis-derived). The term "derived from Bacillus subtilis" or 
"Bacillus subtilis-derived" includes a gene which is naturally found in the 
microorganism Bacillus subtilis. Included within the scope of the present invention are 

35 Bacillus-derived genes (e.g., B. subtilis-derived genes), for example, Bacillus or JJ. 
subtilis yaaD or yaaE genes. 
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II Recombinant Nucleic Acid Mo lecules and Vectors 
The present invention further features recombinant nucleic acid 
molecules (e.g., recombinant DNA molecules) that include genes described herein {e.g., 
isolated genes), preferably Bacillus genes, more preferably Bacillus subtilis genes, even 
5 more preferably Bacillus subtilis B6 vitamer biosynthetic genes. The term "recombinant 
nucleic acid molecule" includes an isolated nucleic acid molecule (e.g., a DNA 
molecule) that has been altered, modified or engineered such that it differs in nucleotide 
sequence from the native or natural nucleic acid molecule from which the recombinant 
nucleic acid molecule was derived (e.g., by addition, deletion or substitution of one or 
10 more nucleotides). Preferably, a recombinant nucleic acid molecule (e.g., a recombinant 
DNA molecule) includes an isolated gene of the present invention operably linked to 
regulatory sequences. The phrase "operably linked to regulatory sequence^)" means 
that the nucleotide sequence of the gene of interest is linked to the regulatory 
sequence(s) in a manner which allows for expression (e.g. , enhanced, increased, 
1 5 constitutive, basal, attenuated, decreased or repressed expression) of the gene, preferably 
expression of a gene product encoded by the gene (e.g., when the recombinant nucleic 
acid molecule is included in a recombinant vector, as defined herein, and is introduced 
into a microorganism). A "recombinant organism" is any organism that contains a 
recombinant nucleic acid molecule. 
20 The term "regulatory sequence" includes nucleic acid sequences that 

affect (e.g, modulate or regulate) expression of other nucleic acid sequences (i.e., 
genes). In one embodiment, a regulatory sequence is included in a recombinant nucleic 
acid molecule in a similar or identical position and/or orientation relative to a particular 
gene of interest as is observed for the regulatory sequence and gene of interest as it 
25 appears in nature, e.g., in a native position and/or orientation. For example, a gene of 
interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence which accompanies or is adjacent to the gene of interest in the 
natural organism (e.g., operably linked to "native" regulatory sequences (e.g, to the 
"native" promoter). Alternatively, a gene of interest can be included in a recombinant 
30 nucleic acid molecule operably linked to a regulatory sequence which accompanies or is 
adjacent to another (e.g., a different) gene in the natural organism. Alternatively, a gene 
of interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence from another organism. For example, regulatory sequences from 
other microbes (e.g., other bacterial regulatory sequences, bacteriophage regulatory 
35 sequences and the like) can be operably linked to a particular gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non- 
naturally-occurring sequence (e.g., a sequence which has been modified, mutated, 
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substituted, derivatized, deleted including sequences which are chemically synthesized). 
Preferred regulatory sequences include promoters, enhancers, termination signals, anti- 
termination signals, ribosome binding sites and other expression control elements {e.g., 
sequences to which repressors or inducers bind and/or binding sites for transcriptional 
5 and/or translation^ regulatory proteins, for example, in the transcribed mRNA). Such 
regulatory sequences are described, for example, in Sambrook, J., Fritsh, E. F., and 
Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 
Regulatory sequences include those which direct constitutive expression of a nucleotide 
10 sequence in a microorganism (e.g., constitutive promoters and strong constitutive 
promoters), those which direct inducible expression of a nucleotide sequence in a 
microorganism (e.g., inducible promoters, for example, xylose inducible promoters) and 
those which attenuate or repress expression of a nucleotide sequence in a microorganism 
(e.g. , attenuation signals or repressor sequences). It is also within the scope of the 
15 present invention to regulate expression of a gene of interest by removing or deleting 
regulatory sequences. For example, sequences involved in the negative regulation of 
transcription can be removed such that expression of a gene of interest is enhanced. 

In one embodiment, a recombinant nucleic acid molecule of the present 
invention includes a nucleic acid sequence or gene that encodes at least one bacterial 
20 gene product (e.g., a B6 vitamer biosynthetic enzyme, e.g., the gene product of yaaD 
and/or yaaE) operably linked to a promoter or promoter sequence. Preferred promoters 
of the present invention include Bacillus promoters and/or bacteriophage promoters 
(e.g., bacteriophage which infect Bacillus). In one embodiment, a promoter is a Bacillus 
promoter, preferably a strong Bacillus promoter (e.g., a promoter associated with a 
25 biochemical housekeeping gene in Bacillus or a promoter associated with a glycolytic 
pathway gene in Bacillus). In another embodiment, a promoter is a bacteriophage 
promoter. In a preferred embodiment, the promoter is from the bacteriophage SP01 . In 
a particularly preferred embodiment, a promoter is selected from the group consisting of 
Pis, P26, or Pveg having for example, the following respective sequences: 
30 GCTATTGACGACAGCTATGGTTCACTGTCCACCAACCAAAACTGTGCTCAGT 
ACCGCCAATATTTCTCCCTTGAGGGGTACAAAGAGGTGTCCCTAGAAGAGAT 
CCACGCrGTGTAAAAATTTTACAAAAAGGTATTGACTTTCCCTACAGGGTGT 

GTAATAATTTAATTACAGGCGGGGGCAACCCCGCCTGT (SEQ ID NO:9), 
GCCTACCTAGCTTCCAAGAAAGATATCCTAACAGCACAAGAGCGGAAAGAT 

35 gttttgttctacatccagaacaacctctg<:taaaattcctgaaaaattttgc 

AAAAAGTTGTTGACTTTATCTACAAGGTGTGGTATAATAATCTTAACAACAG 
CAGGACGC (SEQ ID NO:10); and 
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GAGGAATCATAGAATTTTGTCAAAATAATTlTATTGACAACGTCnTATTAAC 
GTTGATATAATTTAAATTTTATTTGACAAAAATGGGCTCGTGTTGTACAATA • 
AATGTAGTGAGGTGGATGCAATG (SEQ ID NO:l 1). Additional preferred 
promoters include te/(the translational elongation factor (TEF) promoter) and pyc (the 
5 pyruvate carboxylase (PYC) promoter), which promote high level expression in Bacillus 
(e.g., Bacillus subtilis). Additional preferred promoters, for example, for use in Gram 
positive microorganisms include, but are not limited to, amy and SP02 promoters. 
Additional preferred promoters, for example, for use in Gram negative microorganisms 
include, but are not limited to, tac, trp, tet, trp-tet, Ipp, lac, Ipp-lac, lacIQ, T7, T5, T3, 

1 0 gal, trc, ara, SP6, k-PR or k-PL. 

In another embodiment, a recombinant nucleic acid molecule of the 
present invention includes a terminator sequence or terminator sequences (e.g., 
transcription terminator sequences). The term "terminator sequences" includes 
regulatory sequences that serve to terminate transcription of mRNA. Terminator 
1 5 sequences (or tandem transcription terminators) can further serve to stabilize mRNA 
(e.g. , by adding structure to mRNA), for example, against nucleases. 

In yet another embodiment, a recombinant nucleic acid molecule of the 
present invention includes sequences which allow for detection of the vector containing 
said sequences (i.e., detectable and/or selectable markers), for example, genes that 
20 encode antibiotic resistance or sequences that overcome auxotrophic mutations, for 
example, trpC, fluorescent markers, drug markers, and/or colorimetric markers (e.g., 
facZ/p-galactosidase). In yet another embodiment, a recombinant nucleic acid molecule 
of the present invention includes an artificial ribosome binding site (RBS) or a sequence 
that becomes transcribed into an artificial RBS. The term "artificial ribosome binding 
25 site (RBS)" includes a site within an mRNA molecule (e.g. , coded within DNA) to 
which a ribosome binds (e.g., to initiate translation) which differs from a native RBS 
(e.g. , a RBS found in a naturally-occurring gene) by at least one nucleotide. Preferred 
artificial RBSs include about 5-6, 7-8, 9-10, 1 1-12, 13-14, 15-16, 17-18, 19-20, 21-22, 
23-24, 25-26, 27-28, 29-30 or more nucleotides of which about 1-2, 3-4, 5-6, 7-8, 9-10, 
30 11-12, 13-15 or more differ from the native RBS (e.g., the native RBS of a gene of 
interest, for example, Ihe native yaaD RBS 

GAAATCATATAACTATACCTTGATTAGGGGGACCAAGAAATG 
(SEQ ID NO:12) or the native yaaE RBS 

CAAGAACGCGGCTGGTAAGAACATAGGAGCGCTGCTGACATG (SEQ ID 
35 NO:13)). 

Preferably, nucleotides that differ are substituted such that they are 
identical to one or more nucleotides of an ideal RBS when optimally aligned for 
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comparisons. Artificial RBSs can be used to replace the naturaUyHKcurring or native 
KBSs associated with a particular gene. Artificial RBSs preferably increase translation 
of a particular gene. Preferred artificial RBSs (e.g., RBSs for increasing the translation 
of yaaE, for example, of B. subtilis) are set forth in Table 1, below. 

Table 1: Preferred Ribosome Binding Sites 



10 



10 20 

_J L 



SEQ ID NO: 



Native_yaaD - - -GAAAT CATATAACTATACCTTGATTAGGGGGACC-AAGAAATG 12 

Native_yaaE CAAGAACGCGGCTGGTAAGAACAT AGGAGCGCTGCTGACATG 13 

IDEAL RBS TCTAGAAAGG AGGTG A-- 14 

XQ S1 ~ TCTAGAAGG AGGAG AAAACATG 15 

ffl 15 RBS2 TCTAGAGG AGGAG AAAACATG 16 

Q RBS101 TAAGAACAA AGGAGGAGAGCTGACATG 17 

y RBS103 TAAGAAGAA AGGAGGTGAGCTGACATG 18 

~3 RBS102 TAAGAACAG AGGAGGAGAGCTGACATG 19 

SI 

0 20 The present invention further features vectors {e.g., recombinant vectors) 

1 that include nucleic acid molecules (e.g., genes or recombinant nucleic acid molecules 
comprising said genes) as described herein. The term "recombinant vector" includes a 

y vector (e.g., plasmid, phage, phasmid, virus, cosmid or other purified nucleic acid 

S vector) that has been altered, modified or engineered such that it contains greater, fewer 

pj 25 or different nucleic acid sequences than those included in the native or natural nucleic 
ftj acid molecule from which the recombinant vector was derived. Preferably, the 

recombinant vector includes a biosynythetic enzyme-encoding gene or recombinant 
nucleic acid molecule including said gene, operably linked to regulatory sequences, for 
example, promoter sequences, terminator sequences and/or artificial ribosome binding 
30 sites (RBSs), as defined herein. In another embodiment, a recombinant vector of the 
present invention includes sequences that enhance replication in bacteria (e.g., 
replication-enhancing sequences). In one embodiment, replication-enhancing sequences 
are derived from E. coli. In another embodiment, replication-enhancing sequences are 
derived from pBR322. In another embodiment, replication-enhancing sequences are 

35 derived from pSCl 01. 

In yet another embodiment, a recombinant vector of the present invention 
includes antibiotic resistance sequences. The term "antibiotic resistance sequences" 
includes sequences which promote or confer resistance to antibiotics on the host 
organism (e.g. , Bacillus). In one embodiment, the antibiotic resistance sequences are 

40 selected from the group consisting of cat (chloramphenicol resistance) sequences, tet 
(tetracycline resistance) sequences, erm (erythromycin resistance) sequences, neo 
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(neomycin resistance) sequences, kan (kanamycin resistance) and spec (spectmomycin 
resistance) sequences. Recombinant vectors of the present invention can further include 
homologous recombination sequences (e.g. , sequences designed to allow recombination 
of the gene of interest into the chromosome of the host organism). For example, bpr, 
5 vpr andyora^sequencescanbeusedashomologytargetsforrecombinationintothe 
host chromosome. It will further be appreciated by one of skill in the art that the design 
of a vector can be tailored depending on such factors as the choice of microorganism to 
be genetically engineered, the level of expression of gene product desired and the like. 

10 IV. Recombinant Microorganisms 

The present invention further features microorganisms, i.e., recombinant 
microorganisms, that include vectors or genes {e.g. , wild-type and/or mutated genes) as 
described herein. As used herein, the term "recombinant microorganism" includes a 
microorganism (e.g., bacteria, yeast cell, fungal cell, etc.) that has been genetically 
15 altered, modified or engineered (e.g., genetically engineered) such that it exhibits an 
altered, modified or different genotype and/or phenotype (e.g., when the genetic 
modification affects coding nucleic acid sequences of the microorganism) as compared 
to the naturally-occurring microorganism from which it was derived. 

In one embodiment, a recombinant microorganism of the present 
20 invention is a Gram positive organism (e.g., a microorganism which retains basic dye, 
for example, crystal violet, due to the presence of a Gram-positive wall surrounding the 
microorganism). In a preferred embodiment, the recombinant microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. In a more preferred 
25 embodiment, the recombinant microorganism is of the genus Bacillus. In another 
preferred embodiment, the recombinant microorganism is selected from the group 
consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, 
Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulans, 
Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, 
30 Bacillus thuringiensis, Bacillus halodurans, and other Group 1 Bacillus species, for 
example, as characterized by 16S rRNA type. In another preferred embodiment, the 
recombinant microorganism is Bacillus brevis or Bacillus stearothermophilus. In 
another preferred embodiment, the recombinant microorganism is selected from the 
group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, Bacillus subtilis, 

35 and Bacillus pumilus. 

In another embodiment, the recombinant microorganism is a Gram 
negative (excludes basic dye) organism. In a preferred embodiment, the recombinant 
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microorganism is a microorganism belonging to a genus selected from the group 
consisting of Salmonella, Escherichia, Klebsiella, Serratia, and Proteus. In a-more 
preferred embodiment, the recombinant microorganism is of the genus Escherichia. In 
an even more preferred embodiment, the recombinant microorganism is Escherichia 
5 coli. In another embodiment, the recombinant microorganism is Saccharomyces (e.g., S. 
cerevisiae). 

A preferred "recombinant" microorganism of the present invention is a 
microorganism having a deregulated B6 vitamer biosynthesis pathway or enzyme. The 
term "deregulated" or "deregulation" includes the alteration or modification of at least 
10 one gene in a microorganism that encodes an enzyme in a biosynthetic pathway, such 
that the level or activity of the biosynthetic enzyme in the microorganism is altered or 
O modified. Preferably, at least one gene that encodes an enzyme in a biosynthetic 

W pathway is altered or modified such that the gene product is enhanced or increased. The 

5 phrase "deregulated pathway" can also include abiosynthetic pathway in which more 

3 1 5 than one gene that encodes an enzyme in a biosynthetic pathway is altered or modified 
Jj such that the level or activity of more than one biosynthetic enzyme is altered or 

T modified. The ability to "deregulate" a pathway (e.g., to simultaneously deregulate 

p more than one gene in a given biosynthetic pathway) in a microorganism in some cases 

pj arises from the particular phenomenon of microorganisms in which more than one 

W 20 enzyme (e.g., two or three biosynthetic enzymes) are encoded by. genes occurring 
g adjacent to one another on a contiguous piece of genetic material termed an "operon" 

(defined herein). Due to the coordinated regulation of genes included in an operon, 
alteration or modification of the single promoter and/or regulatory element can result in 
alteration or modification of the expression of more than one gene product encoded by 
25 the operon. Alteration or modification of the regulatory element can include, but is not 
limited to removing the endogenous promoter and/or regulatory element(s), adding 
strong promoters, inducible promoters or multiple promoters or removing regulatory 
sequences such that expression of the gene products is modified, modifying the 
chromosomal location of the operon, altering nucleic acid sequences adjacent to the 
30 operon or within the operon such as a ribosome binding site, increasing the copy number 
of the operon, modifying proteins (e.g. , regulatory proteins, suppressors, enhancers, 
transcriptional activators and the like) involved in transcription of the operon and/or 
translation of the gene products of the operon, or any other conventional means of 
deregulating expression of genes routine in the art (including but not limited to use of 
35 antisense nucleic acid molecules, for example, to block expression of repressor 

proteins). Deregulation can also involve altering the coding region of one or more genes 
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to yield, for example, an enzyme that is feedback resistant or has a higher or lower 
specific activity. 

In another preferred embodiment, a recombinant microorganism is 
designed or engineered such that at least one B6 vitamer biosynthetic enzyme, is 
overexpressed. The term "overexpressed" or "overexpression" includes expression of a 
gene product {e.g., a biosynthetic enzyme) at a level greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not 
been manipulated. In one embodiment, the microorganism can be genetically designed 
or engineered to overexpress a level of gene product greater than that expressed in a 
comparable microorganism which has not been engineered. 

Genetic engineering can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression of a particular gene 
(e-g-* by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that expression is constitutive), modifying the 
chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 
particular gene such as a ribosome binding site, increasing the copy number of a 
particular gene, modifying proteins (e.g., regulatory proteins, suppressors, enhancers, 
transcriptional activators and the like) involved in transcription of a particular gene 
and/or translation of a particular gene product, or any other conventional means of 
deregulating expression of a particular gene routine in the art (including but not limited 
to use of antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Genetic engineering can also include deletion of a gene, for example, to block 
a pathway or to remove a repressor. In embodiments featuring microorganisms having 
deleted genes, the skilled artisan will appreciate that at least low levels of certain 
compounds may be required to be present in or added to the culture medium in order 
that the viability of the microorganism is not compromised. Often, such low levels are 
present in complex culture media as routinely formulated. Moreover, in processes 
featuring culturing microorganisms having deleted genes cultured under conditions such 
that commercially or industrially attractive quantities of product are produced, it may be 
necessary to supplement culture media with slightly increased levels of certain 
compounds. 

In another embodiment, the microorganism can be physically or 
environmentally manipulated to overexpress a level of gene product greater than that 
expressed prior to manipulation of the microorganism or in a comparable microorganism 
which has not been manipulated. For example, a microorganism can be treated with or 
cultured in the presence of an agent known or suspected to increase transcription of a 
particular gene and/or translation of a particular gene product such that transcription 
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and/or translation are enhanced or increased. Alternatively, a microorganism can be 
cultured at a temperature selected to increase transcription of a particular gene and/or 
translation of a particular gene product such that transcription and/or translation are 
enhanced or increased. 

V. Culturing and Fermenting Recombinant Microorganisms 
The term "culturing" includes maintaining and/or growing a living 
microorganism of the present invention (e.g., maintaining and/or growing a culture or 
strain). In one embodiment, a microorganism of the invention is cultured in liquid 
media. In another embodiment, a microorganism of the invention is cultured in solid 
media or semi-solid media. In a preferred embodiment, a microorganism of the 
invention is cultured in media (e.g. , a sterile, liquid media) comprising nutrients 
essential or beneficial to the maintenance and/or growth of the microorganism (e.g., 
carbon sources or carbon substrate, for example carbohydrate, hydrocarbons, oils, fats, 
fatty acids, organic acids, and alcohols; nitrogen sources, for example, peptone, yeast 
extracts, meat extracts, malt extracts, soy flour, urea, ammonium sulfate, ammonium 
chloride, ammonium nitrate and ammonium phosphate; phosphorus sources, for 
example, phosphoric acid, sodium and potassium salts thereof; trace elements, for 
example, magnesium, iron, manganese, calcium, copper, zinc, boron, molybdenum, 
and/or cobalt salts; as well as growth factors such as amino acids, vitamins, and the 
like). 

Preferably, microorganisms of the present invention are cultured under 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product. In one embodiment microorganisms are cultured at a pH of about 7. 
In another embodiment, microorganisms are cultured at a pH of between 6.0 and 8.5. 
The desired pH may be maintained by any number of methods known to those skilled in 
the art. 

Also preferably, microorganisms of the present invention are cultured 
under controlled aeration. The term "controlled aeration" includes sufficient aeration 
(e.g., oxygen) to result in production of the desired product. In one embodiment, 
aeration is controlled by regulating oxygen levels in the culture, for example, by 
regulating the amount of oxygen dissolved in culture media. Preferably, aeration of the 
culture is controlled by agitating the culture. Agitation may be provided by a propeller 
or similar mechanical agitation equipment, by revolving or shaking the culture vessel 
(e.g. , tube or flask) or by various pumping equipment. Aeration may be further 
controlled by the passage of sterile air or oxygen through the medium (e.g. , through the 
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. fermentation mixture). Also preferably, microorganisms of the present invention are 
cultured without excess foaming (e.g., via addition of anufoaming agents). 

Moreover, microorganisms of the present invention can be cultured under 
controlled temperatures. The term "controlled temperature" includes any temperature 
5 which results in production of the desired product (e.g., a B6 vitamer). In one 

embodiment, controlled temperatures include temperatures between 15°C and 95°C. In 
another embodiment, controlled temperatures include temperatures between 15°C and 
70°C. Preferred temperatures are between 20°C and 55°C, more preferably between 
30°Cand50°C. 

1 0 Microorganisms can be cultured (e.g. , maintained and/or grown) in liquid 

media and preferably are cultured, either continuously or interrnittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture (e.g. , rotary 
shaking culture, shake flask culture, etc.), aeration spinner culture, or fermentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 
1 5 preferred embodiment, the microorganisms are cultured in a fermentor (e.g., a 

fermentation process). Fermentation processes of the present invention include, but are 
not limited to, batch, fed-batch and continuous processes or methods of fermentation. 
The phrase "batch process" or "batch fermentation" refers to a closed system in which 
the composition of media, nutrients, supplemental additives and the like is set at the 
20 beginning of the fermentation and not subject to alteration during the fermentation, 

however, attempts may be made to control such factors as pH and oxygen concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 
batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates or supplements are added (e.g., added in 
25 increments or continuously) as the fermentation progresses. The phrase "continuous 
process" or "continuous fermentation" refers to a system in which a defined 
fermentation media is added continuously to a fermentor and an equal amount of used or 
"conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g. , a B6 vitamer). A variety of such processes have been developed and are 
30 well-known in the art. 

The phrase "culturing under conditions such that a desired compound is 
produced" includes maintaining and/or growing microorganisms under conditions (e.g., 
. temperature, pressure, pH, duration, etc.) appropriate or sufficient to obtain production 
of the desired compound or to obtain desired yields of the particular compound being 
35 produced. For example, culturing is continued for a time sufficient to produce the 
desired amount of a compound (e.g, aB6 vitamer). Preferably, culturing is continued 
for a time sufficient to substantially reach suitable production of the compound (e.g., a 
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time sufficient to reach a suitable concentration of a B6 vitamer). In one embodiment, 
culturing is continued for about 12 to 24 hours. In another embodiment, culturing is 
continued for about 24 to 36 hours, 36 to 48 hours, 48 to 72 hours, 72 to 96 hours, 96 to 
120 hours, 120 to 144 hours, or greater than 144 hours. The methodology of the present 
invention can further include a step of recovering a desired compound (e.g., a B6 
vitamer). The term "recovering" a desired compound includes extracting, harvesting, 
isolating or purifying the compound from culture media. Recovering the compound can 
be performed according to any conventional isolation or purification methodology 
' known in the art including, but not limited to, treatment with a conventional resin (e.g., 
anion or cation exchange resin, non-ionic adsorption resin, etc.), treatment with a 
conventional adsorbent (e.g., activated charcoal, silicic acid, silica gel, cellulose, 
alumina, etc.), alteration of pH, solvent extraction (e.g., with a conventional sblvent such 
as an alcohol, ethyl acetate, hexane and the like), dialysis, filtration, concentration, 
crystallization, recrystallization, pH adjustment, lyophilization and the like. For 
example, a compound can be recovered from culture media by first removing the 
microorganisms from the culture. The resulting solutions are then passed through or 
over a cation exchange resin to remove cations and/or through or over an anion 
exchange resin to purify or concentrate the desired product. The resulting compound can 
subsequently be converted to a salt (e.g., a chloride or sulfate salt) by ion exchange. 

Preferably, a desired compound of the present invention is "extracted," 
"isolated" or "purified" such that the resulting preparation is substantially free of other 
media components (e.g., free of media components and/or fermentation byproducts). 
The language "substantially free of other media components" includes preparations of 
the desired compound in which the compound is separated from media components or 
fermentation byproducts of the culture from which it is produced. In one embodiment, 
the preparation has greater than about 80% (by dry weight) of the desired compound 
(e.g., less than about 20% of other media components or fermentation byproducts), more 
preferably greater than about 90% of the desired compound (e.g., less than about 10% of 
other media components or fermentation byproducts), still more preferably greater than 
about 95% of the desired compound (e.g., less than about 5% of other media 
components or fermentation byproducts), and most preferably greater than about 98- 
99% desired compound (e.g., less than about 1-2% other media components or 
fermentation byproducts). When the desired compound has been derivatized to a salt, 
the compound is preferably further free of chemical contaminants associated with the 
formation of the salt. When the desired compound has been derivatized to an alcohol, 
the compound is preferably further free of chemical contaminants associated with the 
formation of the alcohol. 
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In an alternative embodiment, the desired compound is not purified from 
the microorganism, for example, when the microorganism is biologically non-hazardous 
(e.g. , safe). For example, the entire culture (or culture supernatant) can be used as a 
source of product (e.g., crude product). In one embodiment, the culture (or culture 
supernatant) is used without modification. In another embodiment, the culture (or 
culture supernatant) is concentrated. In yet another embodiment, the culture (or culture 
supernatant) is dried or lyophilized. 

Preferably, a production method of the present invention results in 
production of the desired compound, e.g., a B6 vitamer, at a significantly high yield. 
The phrase "significantly high yield" includes a level of production or yield which is 
sufficiently elevated or above what is usual for comparable production methods, for 
example, which is elevated to a level sufficient for commercial production of the desired 
product (e.g. , production of the product at a commercially feasible cost). In one 
embodiment, the invention features a production method that includes culturing a 
recombinant microorganism under conditions such that the desired product (e.g., a B6 
vitamer) is produced at a level greater than 5 mg/L. In another embodiment, the 
invention features a production method that includes culturing a recombinant 
microorganism under conditions such that the desired product (e.g., a B6 vitamer) is 
produced at a level greater than 1 0 mg/L. In another embodiment, the invention features 
a production method that includes culturing a recombinant microorganism under 
conditions such that the desired product (e.g., a B6 vitamer) is produced at a level 
greater than 50 mg/L. In yet another embodiment, the invention features a production 
method that includes culturing a recombinant microorganism under conditions such that 
the desired product (e.g., a B6 vitamer) is produced at a level greater than 150 mg/L. 

Depending on the biosynthetic enzyme or combination of biosynthetic 
enzymes manipulated, it may be desirable or necessary to provide (e.g., feed) 
microorganisms of the present invention at least one biosynthetic precursor such that the 
desired compound or compounds are produced. The term "biosynthetic precursor 5 * or 
"precursor" includes an agent or compound which, when provided to, brought into 
contact with, or included in the culture medium of a microorganism, serves to enhance 
or increase biosynthesis of the desired product. In one embodiment, the biosynthetic 
precursor or precursor is glutamine. In another embodiment, the biosynthetic precursor 
or precursor is ribose. The amount of glutamine or ribose added is preferably an amount 
that results in a concentration in the culture medium sufficient to enhance productivity 
of the microorganism (e.g., a concentration sufficient to enhance production of a B6 
vitamer). The term "excess ribose or glutamine" includes ribose or glutamine levels 
increased or higher that those routinely utilized for culturing the microorganism in 
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question. For example, culturing the Bacillus microorganisms described in the instant 
Examples is routinely done in the presence of about 0-5 g/L ribose or glutamine: 
Accordingly, excess ribose or glutamine levels can include levels of about 5-10 g/L or 
more preferably about 5-20 g/L ribose or glutamine. Biosynthetic precursors of the 
present invention can be added in the form of a concentrated solution or suspension 
[e.g., in a suitable solvent such as water or buffer) or in the form of a solid (e.g., in the 
form of a powder). Moreover, biosynthetic precursors of the present invention can be 
added as a single aliquot, continuously or intermittently over a given period of time. 



This invention is further illustrated by the following examples which 
should not be construed as limiting. The contents of all references, patents and 
published patent applications cited throughout this application are incorporated herein 
by reference. 
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EXAMPLES 

EXAMPLE 1 : Biological assay for B6 vitamers using Saccharomyces uvarum. 

Quantitation of B6 vitamers in supernatants of cultures of micro- 
organisms or extracts of organisms that have been genetically modified to increase 
production of B6 vitamers is conveniently done using Saccharomyces uvarum (formerly 
and still often named S. carlsbergensis) strain ATCC 9080 as the indicator strain or test 
organism. The method is essentially that described in the Difco Manual (1984, Difco 
Laboratories, Detroit, MI, 10th Edition, pp. 1 104-1 106), with the modification that 50 
mg/liter of streptomycin sulfate is added to the liquid growth medium for the test 
organism. However, any other appropriate indicator organism may be used, together 
with a medium that is appropriate for that organism that is free of B6 vitamers. For 
example, an R colipdxB mutant can be used in a standard minimal medium that is well 
known in the art, such as M9 glucose minimal medium (Miller, J., (1972) Experiments 
in Molecular Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY). 

When using & uvarum strain ATCC 9080 as the indicator strain, Bacto 
Pyridoxine Y Medium (Difco Laboratories, available through VWR Scientific, Inc.), 
supplemented with 50 mg/liter streptomycin sulfate, is used for the serial dilutions, and 
PN, PL, or PM is used to generate the standard curve. The responses to these three 
standard compounds are almost identical to each other with S. uvarum strain ATCC 
9080 (Figure 3). 

EXAMPLE 2: Deletion of a portion of the yaaDE operon in JJ. subtilis. 

The SOR and SNO genes of Cercospora nicotianae were originally 
identified by mutations that lead to hypersensitivity to singlet oxygen-generating 
reagents (Ehrenschaft, M., et al (1999) Proc. Natl Acad Set USA 96: 9347-9378). 
Mutations in either of these genes also lead to PL auxotrophy. The protein sequences 
obtained from translation of the SOR and SNO open reading frames were used as 
homology probes to search through the B. subtilis genome sequence using the BLAST 
homology search program of the Subtilist website. The SOR protein was significantly 
homologous to the YaaD protein, and the SNO protein was significantly homologous to 
the YaaE protein. Moreover, the genes encoding the YaaD and YaaE proteins (namely 
yaaD andyaaE) occur adjacent to each other on the B. subtilis chromosome as a two 
gene operon. 

General methods for growth, storage, transformation, and molecular 
biology o/B. subtilis strains are given in Harwood, C, and Cutting, S. (1 990), Molecular 
Biological Methods for Bacillus, John Wiley and Sons, New York, NY, hereby 
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incorporated in its entirety by reference. The yaaDE operon DNA sequence was 
amplified using the Polymerase Chain Reaction (PCR) with Pfo Turbo DNA polymerase 
(Stratagene, Inc., used according to the manufacturer's instructions). The DNA primers 
used were RY395 (SEQ ID NO:l) and RY396 (SEQ ID NO:2). RY395, the upstream 
primer, introduces an Xbalsite and artificial ribosome binding site. RY396, the 
downstream primer, introduces a BamHJ site. The template DNA was chromosomal 
DNA isolated form wild type B. subtilis strain PY79. The blunt ended PCR product was 
ligated into the EcoRV site of pGEM5Zf(+) (Prdmega, Inc.) to give plasmid pAN368. 
Next, a gram positive chloramphenicol resistance gene on a blunt DNA fragment was 
ligated into pAN368 that had been cut with Hpal, to give plasmid pDXIF (SEQ ID 
NO:5, Figure 4). pDXIF therefore is deleted for a portion of yaaD and a portion of 
yaaE. pDXIF was used to transform wild type B. subtilis strain PY79 to 5 mg/liter 
chloramphenicol resistance, and a double crossover event was confirmed using PCR and 
the same primers used to clone yaaDE: The resulting strain was named PX1. 

PX1 was able to grow on Spizizen's minimal medium with trace 
elements (SMM) (Harwood, C, and Cutting, S. (1990) Molecular Biological Methods 
for Bacillus, John Wiley and Sons, New York, NY, pp. 548-549) supplemented with 2 
\xM pyridoxal HC1 (Sigma- Aldrich Chemical Co.), but it did not grow without the 
supplement. Thus it was established that at least one of yaaD ox yaaE is required for 
PLP synthesis in B. subtilis. 

EXAMPLE 3: Deletion of yhaF in B. subtilis. 

The protein sequence of the E. colipdxF gene was used as a probe to 
search the B. subtilis genome as described in Example 1. The only significant homolog 
was yhaF. In a fashion similar to that of Example 1 , the yhaF was cloned and deleted 
from the chromosome of PY79 using plasmid pDXl IF (SEQ ID NO:6, Figure 5), to 
give strain PX1 1. The PCR primers used to clone yhaF were RY407 (SEQ ID NO:3) 
and RY408 (SEQ ID NO:4). The restriction sites used for insertion of the antibiotic 
resistance gene were the PshAl and Ehel sites in the yhaF coding region. PX1 1 is a 
serine auxotroph, but not a PL autotroph. By comparison to E. coli, it appears that yhaF 
functions in serine synthesis and probably encodes the equivalent of SerC, but that the 
YhaF protein is not required for PLP synthesis in B. subtilis. Therefore, it is established 
that sequence homology does not necessarily imply functional homology. 

EXAMPLE 4: Overexpression of the yaaDE operon in B. subtilis. 

The Xbal to BamHI fragment from pAN368 that contains the yaaDE 
operon and artificial ribosome binding site was inserted into either of two expression 
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vectors, to yield plasmids pDX14R (SEQ ID NO:7) and pDXITR (SEQ ID NO:8), 
respectively. In pDX14R and pDX17R, the yaaDE operon is expressed from the strong 
constitutive B. subtilis phage SP01 promoters, P 26 and P l5 , respectively (see Figures 6 
and 7). 

5 pDX14R and pDX17R were each transformed into wild type B. subtilis 

strain PY79, selecting for chloramphenicol resistance. The plasmids integrate into the 
chromosome at the yaaDE locus by single crossover. The resulting strains were named 

PX14 and PX17, respectively. 

PX14 and PX17 were grown for 48 hours at 37°C in 5 ml test tube 

10 culturesinarollerdrumataboutlOOrotationsperminute. The culture medium was 
SVY (20 g Difco Veal Infusion Broth, 5 g Difco Yeast Extract, 2 g ammonium sulfate, 5 
g sodium glutamate, and 30 g glucose per liter, buffered with 200 mM potassium 
phosphate, pH 7.0). Cells were removed by centrifugation followed by sterile filtration 
(Millipore'o.45 micron), and the supernatant solutions were assayed for PL equivalents 

15 usingthebiologicalassaydescribedinExamplel. The parent strain, PY79 was grown 
and processed in similar fashion as a control. The uncultured SVY medium was assayed 
as another control, since it was likely that the SVY medium contained a measurable 
level of B6 vitamers. The results are shown in Table 2, below. 

20 Table 2: Production ofB6 vitamers by Bacillus subtUits derivatives in 48 
hour test tube cultures grown in SVY 



Strain 



PX14 
PX17 
PX1 
PY79 



Cassette 



P 26 yaaDE 
PisyaaDE 
AyaaDE 



Integration 
Target 



yaaDE 
yaaDE 
yaaDE 



OD 6 oo 


Total B6 
Vitamers' 
. mg/liter 


NetB6 
Vitamers 2 
mg/liter 


17 


7.2 


7.0 


17 


4.9 


4.7 


8 


0.4 


0.2 


19 


0.8 


0.6 


0.08 


0.2 


(0) 



25 



(Medium) , — - . 

1 Sum of PN, PL, PM, and derivatives thereof that can be utilized by pyridoxin indicator strain S. 
carlbergensis as a source of vitamin B6 for growth. 

2 Calculated by subtracting the amount assayed in the medium. 



After subtracting the B6 vitamers contained in the medium, strain PX14 
produced 7.0 mgAiter PL equivalents, while the parent PY79 produced only 0.6 mgAiter 
of PL equivalents. Thus, expression of iheyaaDE operon has been shown to be rate 
30 lmntmgforB6vitamerproductionin5.^ife. Moreover, a genetically modified 
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strain, where this rate limiting step was enhanced, produced more than a ten-fold 
increase in B6 vitamer secretion compared to that of the parent. 

EXAMPLE 5: Complementation o/E. coli pdx mutants byplasmids mat 
5 express the B. subtilis yaaDE operon. 

Plasmid pDX14R, described above in Example 4, was used to transform 
various K coli strains that contained mutations that lacked function in each of the'known 
genes involved in PLP biosyntehsis (except for dxs, which is an essential gene for E. 
colt). The selection was for resistance to 250 mg/liter ampicillin. Each transformant 
10 was then tested for growth on niinimal medium (SMM with 0.5 % glucose, see Example 
2) supplemented with 100 mg/liter serine, and compared to growth of its respective 
untransformed parent on the same medium. All mutations tested were complemented by 
pDX14R. Specifically,/^, pdxB, pdxF, pdxJ, mdpdxH, were all complemented by 
pDX14R. Therefore, expression of the B. subtilis yaaDE operon in E. coli is sufficient 
15 for PLP biosynthesis, even in the absence of any one of the above functional^ genes. 
Several important and unexpected conclusions or inferences can be drawn form these 
results. First, the substrate(s) for the enzyme(s) encoded by yaaDE must be present in 
E coli, even when a biosynthetic intermediate normally used to make PLP is absent or 
greatly reduced. Second, PNP or PLP is possibly the product of the enzyme(s) encoded 
20 by yaaDE. Third, since an early block in the E. coli PLP biosynthetic pathway (for 
example that in a pdxB mutant) does not prevent yaaDE from complementation, the 
substrates for the en2yme(s) encoded by yaaDE are not likely to be the same as for the 
last step in PNP or PLP synthesis in wild type E. coli. These unexpected results lead to 
the possibility of producing B6 vitamers using B. subtilis yaaDE or the homologous 
25 genes from another organism (for example, but not limited to, SOR and SNO from 
Cercospora nicotianae or PDX1 and PDX2 from S. cerevisiae) in a heterologous host 
species, including, but not limited to, E. coli and Oryza sativa. 

EXAMPLE 6: Overexpression of the yaaDE operon in E coli 
30 Plasmids pDX14R and pDX17R were transformed into E. coli strain 

DH5ct (New England Biolabs), selecting for ampicillin resistance. The transformants 
were grown for 48 hours in 5 ml test tube cultures at 37°C, and the supernatants were 
worked up as in Example 3. The assay results for PL equivalents are shown in Table 3, 
(below. 

35 
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Table 3: Production of B6 vitamers by Escherichia coli harboring plasmids 

containing engineered Bacillus subtilits genes 



Strain 



DH5o 
DH5a 
DH5a 



Plasmid 
Cassette 



P 2 Q>aaDE 
PisyaaDE 



OD 60 o 



7.6 
8.2 
9 



Total B6 
Vitamers 2 
mg/liter 



3.2 
3.2 
0.1 



NetB6 
Vitamers 3 
mg/liter 



3.1 
3.1 

(0) 



'£ coli test tube cultures are grown in SVY for 48 hours. 
2 Sum of PN, PL, PM, and derivatives thereof that can be utilized by pyridoxme indicator strain a. 
5 carlbergensis as a source of vitamin B6 for growth. 

3 Calculated by subtracting the amount assayed in DH5ct not containing plasmid. 

Thus it has been shown that the yaaD and yaaE genes can be expressed 
in a heterologous host strain, and B6 vitamers can still be overproduced. By extension 
10 of this approach, the yaaD and yaaE genes of B. subtilis can be overexpressed in any 
organism where an overexpression system exists, and in the resulting strains, B6 
vitamers will be overproduced. Overproduction of the rate limiting enzyme for B6 
vitamer production in any organism that is capable of producing B6 vitamers will lead to 
overproduction of B6 vitamers. 
1 5 The YaaD and YaaE protein sequences were used as probes to search the 

NCBI database for homologs in plants using the BLAST™ program which can be found 
at the National Center for Biotechnology Information website (Altschul S.F (1990) 
J. Mol. Biol. 215(3):403-10). Several homologs of YaaD were found in several genera 
of plants, including Arabidopsis, Oryza, Ginkgo, Hevea, Phaseolus, m&Stellaria. Two 
20 homologs of YaaE were found in Arabidopsis thaliana. However, no homologs of pdxA 
wd pdxJv/eie found. Therefore the plant kingdom appears to use the Type B Pathway 
for B6 vitamer biosynthesis. Thus for example, overexpression of the YaaD homolog 
(GenBank accession number AAL73561) from Oryza sativa (rice), and the A. thaliana 
homolog of YaaE (GenBank accession number AB01 1483) together in a plant using 
25 methods well know in the art, such as expression from the Cauliflower Mosaic Virus 
35S promoter, will lead to overproduction of B6 vitamers in that plant. 

EXAMPLE 7: Other routes to increasing the activity of enzymes involved in 
B6 vitamer synthesis. 
30 The overexpression of the yaaDE operon leads to an increase in the 

amount of the encoded enzyme(s), which in turn leads to an increase in the total activity 
of said enzyme(s). Increase in this activity leads to an increase in the production B6 
vitamer. Other methods can be used to increase the activity of the relevant enzyme(s) 
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under conditions of B6 vitamer production. For example, in addition to increasing the 
amount of a relevant enzyme(s), the total activity of the relevant enzyme(s) can be 
increased by mutating the gene(s) to increase the specific activity of the enzyme(s), 
and/or by mutating the gene(s) to encode a feedback resistant variant of the enzyme(s). 

5 Such desirable mutations can be obtained by screening large numbers of mutants for the 
increased activity as evidenced by an increase in B6 vitamer production as described^ 
Example 4, or by selecting for mutants that are resistant to inhibitors that are specific for 
the PLP biosynthetic pathway, and screening among those mutants for an increase in B6 
vitamer production. Examples of such inhibitors are isoniazid, iproniazid, and 

10 ginkgotoxin (4'-methoxy pyridoxine) (Dempsey and Arcement (1971) J. Bacteriol. 
107(2): 580-582; Pflug, W., and Lingens, F., (1978) Hoppe-Seyler's Z. Physiol. Chem. 
359: 559-570; Fiehe, K., et al., (2000) J. Nat. Prod. 63(2): 1 85-189). 



1 5 EXAMPLE 8: Processing of biosynthetic B6 vitamers. 

A B6 vitamer produced by a genetically modified organism of the 
invention can be harvested and processed into a format that is appropriate for 
commercial use. For example, after culturing a B6 vitamer producing micro-organism in 
liquid culture, the entire culture, including cells can be dried by evaporation or by spray 
20 drying, and the resulting powder can be mixed into animal feeds. Alternatively, the cells 
can be first removed by centrifugation or filtration, and the resulting supernatant 
solution can be dried as described above. As another alternative, the B6 vitamer can be 
purified from the culture broth by techniques well known in the art, such as filtration, 
reverse osmosis, column chromatography (ion exchange, hydrophobic or hyrophilic 
25 adsorption, gel filtration, etc.), solvent extraction, precipitation, distillation, evaporation, 
and the like. If the B6 vitamer producing organism is a plant, then the appropriate 
portion of the plant (for example the leaves, stems, roots, flowers, fruits, seeds, or any 
combination thereof) can be harvested and processed. For example the plant material 
can be dried and used directly, or the material can be pulverized or ground and the B6 
30 vitamer extracted and/or processed as described above for cultures. 

The production organism can be a micro-organism that normally inhabits 
the gut of humans or an animal if interest (for example one of many bacteria of the 
genus Lactobacillus, such as L. acidophilus ), and the B6 vitamer can be delivered by 
ingestion of the organism. 

35 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
following claims. 
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What is claimed: 

1 . An organism that has been genetically modified to comprise a 
recombinant DNA molecule that results in the increase of the activity of one or more 

5 enzymes that catalyze(s) a step in the biosynthesis of a B6 vitamer, such that B6 vitamer 
production from said modified organism is increased compared to B6 production in an 
unmodified parent organism. 

2. The organism of claim 1 , wherein p6 vitamer production is at 
1 0 least ten-fold higher than from the unmodified parent organism. 

3 . The organism of claim 1 , wherein said enzyme is one or more of 
YaaDorYaaE. 

15 4. The organism of claim 1 , wherein said organism is genetically 

modified to overexpress one or more genes that encodes an enzyme that catalyzes a step 
in the biosynthesis of a B6 vitamer. 

5. The organism of claim 4, wherein at least one of said genes is a 

20 yaaD gene. 

6. The organism of claim 4, wherein at least one of said genes is a 

yaaE gene. 

25 7. The organism of claim 4, wherein at least two of said genes are 

yaaD and yaaE genes. 

8. The organism of claim 4, wherein said organism is a Bacillus 

strain. 

30 

9. The organism of claim 4, wherein said organism is Bacillus 

subtilis. 



35 io. The organism of claim 4, wherein said organism is Escherichia 

coli. 
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.11. The organism of claim 1 0, wherein said genes are selected from 
the group consisting of E. coli epd, pdxA, pdxJ, pdxF, pdxB,pdxH or dxs. 

12. The organism of any one of claims 1-1 1, wherein said organism is 
5 grown in a liquid culture and the total B6 vitamer concentration in the culture 

supernatant it at least 7.0 mg/liter. 

13. A method of producing a B6 vitamer comprising culturing a 
microorganism that has been genetically modified to overexpress one or more genes that 

1 0 encodes an enzyme that catalyzes a step in the biosynthesis of a B6 vitamer, such that 
B6 vitamer production from said modified organism is increased compared to B6 
production in an unmodified parent organism, under conditions such that the B6 vitamer 
is produced. 

15 14. The method of claim 13, wherein said enzyme is one or more of 

YaaDorYaaE. 



yaaD gene. 

20 



yaaE gene. 



15. The method of claim 13, wherein at least one of said genes is a 



16. The method of claim 13, wherein at least one of said genes is a 



17. The method of claim 13, wherein said genes are contained on the 



25 yaaDE operon. 



30 



35 



derived. 



Bacillus. 



18. The method of claim 13, wherein the B6 vitamer is pyridoxins 

1 9. The method of claim 13, wherein the B6 vitamer is pyridoxal. 

20. The method of claim 13, wherein the B6 vitamer is pyridoxamine. 

21. The method of claim 1 3, wherein the said genes are bacterial- 



22. The method of claim 1 3, wherein said genes are derived from 
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23. The method of claim 13, wherein said genes are derived from 
Bacillus subtilis. 

5 24. The method of claim 1 3 , wherein the microorganism is Gram 

positive. 

25. The method of claim 13, wherein the microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
1 0 Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. 

^ 26 The method of claim 13, wherein the microorganism is of the 

y genus Bacillus. 

m 

«j 15 27. The method of claim 1 3, wherein the microorganism is Bacillus 

00 subtilis. 

□ 28. The method of claim 1 3, further comprising recovering the B6 

W vitamer. 

PJ 

ry 20 

O 29. A method of producing a B6 vitamer comprising culturmg a 

W microorganism that overexpresses at least one Bacillus B6 vitamer biosynthetic gene 

under conditions such that the B6 vitamer is produced. 

25 30. The method of claim 29, wherein the microorganism 

overexpresses at least one Bacillus subtilis B6 vitamer biosynthetic enzyme. 

3 1 . The method of claim 29, wherein the B6 vitamer is pyridoxine. 

30 32. The method of claim 29, wherein the B6 vitamer is pyridoxal. 

33. The method of claim 29, wherein the B6 vitamer is pyridoxamine. 



35 



34. The method of claim 29, wherein the microorganism 
overexpresses at least two B6 vitamer biosynthetic enzymes. 
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35. The method of claim 29, wherein the microorganism is Gram 

positive. 

36. The method of claim 29, wherein the microorganism is Gram 

5 negative. 

37. The method of claim 29, wherein the microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. 

10 

38. The method of claim 29, wherein the microorganism is of the 
genus Bacillus. 

39. The method of claim 29, wherein the microorganism is Bacillus 

15 subtilis. 

40. The method of claim 29, further comprising recovering the B6 

vitamer. 

20 4 1 . A recombinant microorganism that overexpresses at least one 

Bacillus B6 vitamer biosynthetic gene. 

42. A recombinant microorganism that overexpresses at least one 
Bacillus B6 vitamer biosynthetic enzyme. 

25 

43 . The method of claim 42, wherein said enzyme is YaaD or YaaE. 

44. The recombinant microorganism of claim 41 that overexpresses at 
least one Bacillus subtilis B6 vitamer biosynthetic gene. 

30 

45. The recombinant microorganism of claim 41, wherein the B6 
vitamer biosynthetic gene is selected from the group consisting of yaaD zndyaaE. 

46. The recombinant microorganism of claim 41, that is Gram 

35 positive. 
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47. The recombinant microorganism of claim 41 belonging to a genus 
selected from the group consisting of Bacillus, Cornyebacterium, Lactobacillus, 
Lactococci and Streptomyces. 

5 48. The recombinant microorganism of claim 41 belonging to the 

genus Bacillus. 

49. The recombinant microorganism of claim 4 1 which is Bacillus 

subtilis. 

10 

50. A recombinant microorganism selected from the group consisting 
ofPX14andPX17. 

m 
s 

yj 5 1 . A vector comprising a nucleic acid sequence that encodes at least 

51 1 5 one Bacillus B6 vitamer biosynthetic gene operably linked to regulatory sequences. 

SI 

o 

K 52. The vector of claim 5 1 , comprising a nucleic acid sequence that 

y encodes at least one Bacillus subtilis B6 vitamer biosynthetic gene. 

O 

W 20 53 . The vector of claim 5 1 , wherein the regulatory sequences 

fy comprise a constitutively active promoter. 

S 

Fa 54. The vector of claim 5 1 , wherein the constitutively active promoter 

comprises P J5 (SEQ ID NO:9) or P 26 (SEQ ID NO:10) sequences. 

25 

55. The vector of claim 51, wherein the regulatory sequences 
comprise at least one artificial ribosome binding site (RBS). 

56. A vector selected from the group consisting of pDXHR and 

30 pDX17R. 

57. A recombinant microorganism comprising the vector of claim 56. 



58. An isolated nucleic acid molecule that encodes at least one 
35 Bacillus B6 vitamer biosynthetic gene. 
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59. The isolated nucleic acid molecule of claim 58 that encodes at 
least one Bacillus subtilis B6 vitamer biosynthetic gene. 

60. An isolated Bacillus B6 vitamer biosynthetic enzyme polypeptide. 

61 . An isolated Bacillus subtilis B6 vitamer biosynthetic enzyme 

polypeptide. 
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METHODS AND ORGANISMS FOR 
PRODUCTION OF B6 VITAMERS 

5 Abstract of the Disclosure 

The present invention features methods of producing B6 vitamers that 
involve culturing an organism overexpressing an enzyme that catalyzes a step in the 
biosynthesis of a B6 vitamer under conditions such that a B6 vitamer is produced. The 
1 0 present invention further features methods of producing B6 vitamers that involve 
culturing recombinant microorganisms that overexpress at least one B6 vitamer 
biosynthetic gene, e.g., yaaD oxyaaE. 
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FIGURE 7. Structure ofpDX17R 




Atty. Docket No.: BGM52-1 



1 



SEQUENCE LISTING 

<110> Yocura, R. Rogers 
Williams , Mark K. 
Pero, Janice G. 

<120> METHODS AND ORGANISMS FOR PRODUCTION OF B6 VITAMERS 

<130> BGI-152-1 

<160> 19 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 1 

ccctctagag gaggagaaaa catggctcaa acaggtactg aacgtg 

<210> 2 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 2 

cccggatcct caactgtttt atacaagtgc cttttgctta tattc 

<210> 3 
<211> 49 
<212> DNA 

<213> Artificial Sequence. 
<220> 

<223> primer 
<400> 3 

ccctctagag gaggagacat aatggaacgt acaacgaatt ttaacgcag 

<210> 4 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 4 

cccggatccc ggacggtttg catagccaga ctttttactc a 

<210> 5 
<211> 5322 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> plasmid - pDXIF 
<400> 5 

tgcgccgcta cagggcgcgt ccattcgcca 
tcggtgcggg cctcttcgct attacgccag 
ttaagttggg taacgccagg gttttcccag 
ttgtaatacg actcactata gggcgaattg 
ggccgcggga tccctctaga ggaggagaaa 
aacgcggaat ggcagaaatg caaaaaggcg 
aagcgaaaat cgctgaagaa gctggagctg 
cagatattcg cgcggctgga ggagttgccc 
taatgaatgc agtatctatc ccggtaatgg 
cgcgtgtgct tgaagctatg ggtgttgact 
ctgacgaaga atttcattta aataaaaatg 
gtgatcttgg tgaagcaaca cgccgtattg 
gtgagcctgg aacaggtaat attgttgagg 
gtgcggccgc ctgcaggtcg accatatggg 
ttattatcaa gataagaaag aaaaggattt 
caaaagacca cattttttaa tgtggtcttt 
caaacgaaaa ttggataaag tgggatattt 
tgacttttaa aaaaggattg attctaatga 
actttagata aaaatttagg aggcatatca 
ttggaagaga aaagagatat ttaatcatta 
cacagaaatt gatattagtg ttttataccg 
ccctgcattt attttcttag tgacaagggt 
ttacaatagc gacggagagt taggttattg 
tgatggtgta tctaaaacat tctctggtat 
gttttatgat ttataccttt ctgatgtaga 
caaaacacct atacctgaaa atgctttttc 
tgggtttaac ttaaatatca ataataatag 
aaaattcatt aataaaggta attcaatata 
tgtttgtgat ggttatcatg caggattgtt 
gcctaatgac tggcttttat aatatgagat 
tctaatgtca ctaacctgcc ccgttagttg 
cgggctccca acgcgttgga tgcatagctt 
ggcgtaatca tggtcatagc tgtttcctgt 
taggacttca aggagcagtt agagagcaca 
gtcttgtcgt aaaacgtccg gagcagctga 
gtgagagcac gacgatgcgc cgtttgatcg 
aattcgctgc tcagggcaaa ccgatgtttg 
aagaaattgc cggttcagat aatcctcatt 
attcatttgg ccggcaggtt gacagctttg 
agccttttac tggggtattc atccgtgctc 
aagttctatc ggagcataat ggtcgtattg 
gctcattcca tccggagctg acagaagatc 
ttgaggaata taagcaaaag gcacttgtat 
cggccgcctg caggtcgacc atatgggaga 
tattctatag tgtcacctaa atagcttggc 
aaattgttat ccgctcacaa ttccacacaa 
ctggggtgcc taatgagtga gctaactcac 
ccagtcggga aacctgtcgt gccagctgca 
cggtttgcgt attgggcgct cttccgcttc 
tcggctgcgg cgagcggtat cagctcactc 
aggggataac gcaggaaaga acatgtgagc 
aaaggccgcg ttgctggcgt ttttcgatag 
tcgacgctca agtcagaggt ggcgaaaccc 
ccctggaagc tccctcgtgc gctctcctgt 
cgcctttctc ccttcgggaa gcgtggcgct 



ttcaggctgc gcaactgttg ggaagggcga 60 
ctggcgaaag ggggatgtgc tgcaaggcga 120 
tcacgacgtt gtaaaacgac ggccagtgaa 180 
ggcccgacgt cgcatgctcc cggccgccat 240 
acatggctca aacaggtact gaacgtgtaa 300 
gcgtcatcat ggacgtcatc aatgcggaac 360 
tcgctgtaat ggcgctagaa cgtgtgccag 420 
gtatggctga ccctacaatc gtggaagaag 480 
caaaagcgcg tatcggacat attgttgaag 540 
atattgatga aagtgaagtt ctgacgccgg 600 
aatacacagt tccttttgtc tgtggctgcc 660 
cggaaggtgc ttctatgctt cgcacaaaag ■ 720 
ctgttcgcca tatgcgtaaa gttatcacta 780 
agaggcggcc gcgtcgacca atagttaccc 840 
ttcgctacgc tcaaatcctt taaaaaaaca- 900 
attcttcaac taaagcaccc attagttcaa 960 
ttaaaatata tatttatgtt acagtaatat 1020 
agaaagcaga caagtaagcc tcctaaattc 1080 
aatgaacttt aataaaattg atttagacaa 1140 
tttgaaccaa caaacgactt ttagtataac 1200 
aaacataaaa caagaaggat ataaatttta 1260 
gataaactca aatacagctt ttagaactgg 1320 
ggataagtta gagccacttt atacaatttt 1380 
ttggactcct gtaaagaatg acttcaaaga 14 40 
gaaatataat ggttcgggga aattgtttcc 1500 
tctttctatt attccatgga cttcatttac 1560 
taattacctt ctacccatta ttacagcagg 1620 
tttaccgcta tctttacagg tacatcattc ■ 1680 
tatgaactct attcaggaat tgtcagatag ' 1740 
aatgccgact gtacttttta cagtcggttt 1800 
aagaaggttt ttatattaca gctccggtac 1860 
gagtattcta tagtgtcacc taaatagctt 1920 
gtgaaattgt tatccgaaca ataggtgtac 1980 
tccatgcgat tgaagcatgc ggcgcggctg 2040 
acgaagttga cgggttgatt ttgccgggcg 2100 
atacgtatca attcatggag ccgcttcgtg 2160 
gaacatgtgc cggattaatt atattagcaa 2220 
taggtcttct gaatgtggtt gtagaacgta 2280 
aagctgattt aacaattaaa ggcttggacg 2340 
cgcatatttt agaagctggt gaaaatgttg 2400 
tagccgcgaa acaggggcaa ttccttggct 2460 
accgagtgac gcagctgttt gttgaaatgg 2520 
aaaacagttg aggatccggg atcactagtg 2580 
gctcccaacg cgttggatgc atagcttgag 2640 
gtaatcatgg tcatagctgt ttcctgtgtg 2700 
catacgagcc ggaagcataa agtgtaaagc 2760 
attaattgcg ttgcgctcac tgcccgcttt 2820 
ttaatgaatc ggccaacgcg cggggagagg 2880 
ctcgctcact gactcgctgc gctcggtcgt 2940 
aaaggcggta atacggttat ccacagaatc 3000 
aaaaggccag caaaaggcca ggaaccgtaa 3060 
gctccgcccc cctgacgagc atcacaaaaa 3120 
gacaggacta taaagatacc aggcgtttcc 3180 
tccgaccctg ccgcttaccg gatacctgtc 3240 
ttctcatagc tcacgctgta ggtatctcag 3300 
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ttcggtgtag gtcgttcgct ccaagctggg 
ccgctgcgcc ttatccggta actatcgtct 
gccactggca gcagccactg gtaacaggat 
agagttcttg aagtggtggc ctaactacgg 
cgctctgctg aagccagtta ccttcggaaa 
aaccaccgct ggtagcggtg gtttttttgt 
aggatctcaa gaagatcctt tgatcttttc 
ctcacgttaa gggattttgg tcatgagatt 
aaattaaaaa tgaagtttta aatcaatcta 
ttaccaatgc ttaatcagtg aggcacctat 
agttgcctga ctccccgtcg tgtagataac 
cagtgctgca atgataccgc gagacccacg 
ceagccagcc ggaagggccg agcgcagaag 
gtctattaat tgttgccggg aagctagagt 
cgttgttggc attgctacag gcatcgtggt 
cagctccggt tcccaacgat caaggcgagt 
ggttagctcc ttcggtcctc cgatcgttgt 
catggttatg gcagcactgc ataattctct 
tgtgactggt gagtactcaa ccaagtcatt 
ctcttgcccg gcgtcaatac gggataatag 
catcattgga aaacgttctt cggggcgaaa 
cagttcgatg taacccactc gtgcacccaa 
cgtttctggg tgagcaaaaa caggaaggca. 
acggaaatgt tgaatactca tactcttcct 
ttattgtctc atgagcggat acatatttga. 
.tccgcgcaca tttccccgaa aagtgccacc 
gtaaggagaa aataccgcat caggcgaaat 
gttaaatatt tgttaaatca gctcattttt 
ttataaatca aaagaataga ccgagatagg 
tccactatta aagaacgtgg actccaacgt 
tggcccacta cgtgaaccat cacccaaatc 
tctaaatcgg aaccctaaag ggagcccccg 
cgtggcgaga aaggaaggga agaaagcgaa 
agcggtcacg ctgcgcgtaa ccaccacacc 



ctgtgtgcac gaaccccccg ttcagcccga 3360 
tgagtccaac ccggtaagac acgacttatc 3420 
tagcagagcg aggtatgtag gcggtgctac 3480 
ctacactaga aggacagtat ttggtatctg 354 0 
aagagttggt agctcttgat ccggcaaaca 3600 
ttgcaagcag cagattacgc gcagaaaaaa 3660 
tacggggtct gacgctcagt ggaacgaaaa 3720 
atcaaaaagg atcttcacct agatcctttt 3780 
aagtatatat gagtaaactt ggtctgacag 3840 
ctcagcgatc tgtctatttc gttcatccat 3900 
tacgatacgg gagggcttac catctggccc 3960 
ctcaccggct ccagatttat cagcaataaa 4020 
tggtcctgca actttatccg cctccatcca 4 080 
aagtagttcg ccagttaata gtttgcgcaa 4140 
gtcacgctcg tcgtttggta tggcttcatt 4200 
tacatgatcc cccatgttgt gcaaaaaagc 4260 
cagaagtaag ttggccgcag tgttatcact 4320 
tactgtcatg ccatccgtaa gatgcttttc 4380 
ctgagaatac cgcgcccggc gaccgagttg 4 44 0 
tgtatgacat agcagaactt taaaagtgct 4500 
actctcaagg atcttaccgc tgttgagatc 4560 
ctgatcttca. gcatctttta ctttcaccag 4 620 
aaatgccgca aaaaagggaa taagggcgac 4 680 
ttttcaatat tattgaagca tttatcaggg 47 40 
atgtatttag aaaaataaac aaataggggt 4 800 
tgtatgcggt gtgaaatacc gcacagatgc 4860 
tgtaaacg.tt aatattttgt taaaattcgc 4 920 
taaccaatag gccgaaatcg gcaaaatccc 4 980 
gttgagtgtt gttccagttt ggaacaagag 5040 
caaagggcga aaaaccgtct atcagggcga 5100 
aagttttttg cggtcgaggt gccgtaaagc 5160 
atttagagct tgacggggaa agccggcgaa 5220 
aggagcgggc gctagggcgc tggcaagtgt 5280 
cgccgcgctt- aa 5322 



<210> 6 
<211> 5297 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid - pDXllF 
<400> 6 

tgcgccgcta cagggcgcgt ccattcgcca 
tcggtgcggg cctcttcgct attacgccag 
ttaagttggg taacgccagg gttttcccag 
ttgtaatacg actcactata gggcgaattg 
ggccgcggga tccctctaga ggaggagaca 
gtcctgcagc gctgccactg gaagttctgc 
acgaatccgg catgtctgtt atggagcttt 
accaaaaagc gaaaagcctc ttaatcgaac 
tgtttcttca aggcggggca agccttcaat 
ctgaaaaaac cgcacatttt gtgatgaccg 
cgaaactgtt cgggaacacg tctatcaccg 
ttccagaggt tgaccttacg gatgtaaaag 
atacaatttt cggcactcag tggcaggagt 
acatgtccag cgatatttta agcagaaaaa 
gaggcatcac tagtgcggcc gcctgcaggt 



ttcaggctgc gcaactgttg ggaagggcga 60 
ctggcgaaag ggggatgtgc tgcaaggcga 120 
tcacgacgtt gtaaaacgac ggccagtgaa 180 
ggcccgacgt cgcatgctcc cggccgccat 240 
taatggaacg tacaacgaat tttaacgcag 300 
aaaaagcaca gaaagaattt attgatttta 360 
cccaccgcag caaagagtat gaagcggtgc 420 
tgatgggcat tccggaagat tacgatatct .480 
tctcaatgct tccgatgaac tttttaacac 540 
gcgcttggtc tgaaaaagca ctggcagaaa 600 
ctacaagtga aacagacaat tacagttata 660 
acggcgcata tttacatatc acatccaaca 720 
ttccgaattc tccaattccg ctcgtagccg 780 
tcgatgtgtc caaatttgat gtgatctacg 840 
cgaccatatg ggagaggcgg ccgcgtcgac 900 
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caatagttac ccttattatc aagataagaa agaaaaggat ttttcgctac gctcaaatcc 960 
tttaaaaaaa cacaaaagac cacatttttt aatgtggtct ttattcttca actaaagcac 1020 
ccattagttc aacaaacgaa aattggataa agtgggatat ttttaaaata tatatttatg 1080 
ttacagtaat attgactttt aaaaaaggat tgattctaat gaagaaagca gacaagtaag 1140 
cctcctaaat tcactttaga taaaaattta ggaggcatat caaatgaact ttaataaaat 1200 
tgatttagac aattggaaga gaaaagagat atttaatcat tatttgaacc aacaaacgac 1260 
ttttagtata accacagaaa ttgatattag tgttttatac cgaaacataa aacaagaagg 1320 
atataaattt taccctgcat ttattttctt agtgacaagg gtgataaact caaatacagc 1380 
ttttagaact ggttacaata gcgacggaga gttaggttat tgggataagt tagagccact 1440 
ttatacaatt tttgatggtg tatctaaaac attctctggt atttggactc ctgtaaagaa 1500 
tgacttcaaa gagttttatg atttatacct ttctgatgta gagaaatata atggttcggg 1560 
gaaattgttt cccaaaacac ctatacctga aaatgctttt tctctttcta ttattccatg 1620 
gacttcattt actgggttta acttaaatat caataataat agtaattacc ttctacccat 1680 
tattacagca ggaaaattca ttaataaagg taattcaata tatttaccgc tatctttaca 1740 
ggtacatcat tctgtttgtg atggttatca tgcaggattg tttatgaact ctattcagga 1800 
attgtcagat aggcctaatg actggctttt ataatatgag ataatgccga ctgtactttt I860 
tacagtcggt tttctaatgt cactaacctg ccccgttagt tgaagaaggt ttttatatta 1920 
cagctccggt accgggctcc caacgcgttg gatgcatagc ttgagtattc tatagtgtca 1980 
cctaaatagc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgta 2040 
gtcatcatga aaaaaagctg gctgcaaaat gaaaatgcga acgtcccaaa *aatcttgaaa 2100 
g% tattccacgc atgtcaaagc ggattcactc tacaacactc cgccgacatt tgcgatttat 2160 
« atgctgagcc tcgttctgga atggctcaag gaaaacggcg gtgtggaagc tgttgaacag 2220 
^ 3 cgcaatgaac aaaaagcgca ggttctctac agctgtattg atgaaagcaa cggcttctat 2280 
aaaggacatg ccagaaaaga cagccgctca cgcatgaatg tcacattcac gcttcgggat 2340 
gacgaattaa cgaaaacatt cgttcagaaa gcaaaagatg cgaagatgat cggccttggc 2400 
Nf ggacaccgtt cggtgggagg ctgccgcgct tctatttata acgcggtctc tctcgaagac 2460 
O tgtgaaaaat tagctgcgtt catgaagaaa ttccagcagg aaaatgagta aaaagtctgg 2520 
fg ctatgcattc cgtccgggat ccgggatcac tagtgcggcc gcctgcaggt cgaccatatg 2580 
ii ggagagctcc caacgcgttg gatgcatagc ttgagtattc tatagtgtca cctaaatagc 2640 
ttggcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca 2700 
L cacaacatac gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa 2760 
R ctcacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag 2820 
W ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc 2880 
m gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 2940 
W cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 3000 
Q tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 3060 
f|J gataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 3120 
aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 3180 
cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 3240 
gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 3300 
ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 3360 
cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 3420 
aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 3480 
tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 3540 
ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 3600 
tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 3660 
ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 3720 
agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 3780 
atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 3840 
cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 3900 
ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 3960 
ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 4020 
agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 4080 
agagtaagta gttcgccagt taatagtttg cgcaacgttg ttggcattgc tacaggcatc 4140 
gtggtgtcac gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg 4200 
cgagttacat gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 4260 
gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat 4320 
tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag 4380 
tcattctgag aataccgcgc ccggcgaccg agttgctctt gcccggcgtc aatacgggat 4440 
aatagtgtat gacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg 4500 
cgaaaactct caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca 4560 
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cccaactgat 
aggcaaaatg 
ttcctttttc 
tttgaatgta 
ccacctgtat 
gaaattgtaa 
ttttttaacc 
atagggttga 
aacgtcaaag 
aaatcaagtt 
ccccgattta 
gcgaaaggag 
acacccgccg 



cttcagcatc 
ccgcaaaaaa 
aatattattg 
tttagaaaaa 
gcggtgtgaa 
acgttaatat 
aataggccga 
gtgttgttcc 
ggcgaaaaac 
ttttgcggtc 
gagcttgacg 
cgggcgctag 
cgcttaa 



ttttactttc 
gggaataagg 
aagcatttat 
taaacaaata 
ataccgcaca 
tttgttaaaa 
aatcggcaaa 
agtttggaac 
cgtctatcag 
gaggtgccgt 
gggaaagccg 
ggcgctggca 



accagcgttt 
gcgacacgga 
cagggttatt 
ggggttccgc 
gatgcgtaag 
ttcg'cgttaa 
atcccttata 
aagagtccac 
ggcgatggcc 
aaagctctaa 
gcgaacgtgg 
agtgtagcgg 



ctgggtgagc 
aatgttgaat 
gtctcatgag 
gcacatttcc 
gagaaaatac 
atatttgtta 
aatcaaaaga 
tattaaagaa 
cactacgtga 
atcggaaccc 
cgagaaagga 
tcacgctgcg 



aaaaacagga 
actcatactc 
cggatacata 
ccgaaaagtg 
cgcatcaggc 
aatcagctca 
atagaccgag 
cgtggactcc 
accatcaccc 
taaagggagc 
agggaagaaa 
cgtaaccacc 



4 620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5297 



<210> 7 
<211> 6731 
<212> DNA 

<213> Artificial Sequence 



m 

Q 

w 
m 
si 
o 
a 

b 
U 

m 
m 

Q 

m 



<220> 

<223> plasmicl 



pDX14R 



<400> 7 

ttgcggccgc 

gacattagaa 

tcattaggcc 

tcacaaacag 

atgaattttc 

ttaaacccag 

ggtgttttgg 

tcataaaact 

acaccatcaa 

ctattgtaac 

aatgcagggt 

atttctgtgg 

ctcttccaat 

atctaaagtg 

taaaagtcaa 

tttcgtttgt 

ggtcttttgt 

tgataataag 

tccgtcagga 

cgccaccctc 

ctcaggagag 

gcctttcgtt. 

accatcggcg 

gctactgccg 

tatcaggctg 

tgccttttgc 

tgtcagctcc 

accattatgc 

gatgaatacc 

gtcaacctgc 

attatctgaa 

cggtttgccc 

acggcgcatc 

ctccggacgt 

tctaactgct 

ttcttaccag 



ttcgaaagct 
aaccgactgt 
tatctgacaa 
aatgatgtac 
ctgctgtaat 
taaatgaagt 
gaaacaattt 
ctttgaagtc 
aaattgtata 
cagttctaaa 
aaaatttata 
ttatactaaa 
tgtctaaatc 
aatttaggag 
tattactgta 
tgaactaatg 
gtttttttaa 
ggtaactatt 
tggccttctg 
cgggccgttg 
cgttcaccga 
ttatttgatg 
ctacggcgtt 
ccaggcaaat 
aaaatcttct 
ttatattcct 
ggatggaatg 
tccgatagaa 
ccagtaaaag 
cggccaaatg 
ccggcaattt 
tgagcagcga 
gtcgtgctct 
tttacgacaa 
ccttgaagtc 
ccgcgttctt 



gtaatataaa 
aaaaagtaca 
ttcctgaata 
ctgtaaagat 
aatgggtaga 
ccatggaata 
ccccgaacca 
attctttaca 
aagtggctct 
agctgtattt 
tccttcttgt 
agtcgtttgt 
aattttatta 
gcttacttgt 
acataaatat 
ggtgctttag 
aggatttgag 
gaattcggta 
cttaatttga 
cttcgcaacg 
caaacaacag 
cctggcagtt 
tcacttctga 
tctgttttat 
ctcatccgcc 
caaccatttc 
agcagccaag 
cttcaacatt 
gctcgtccaa 
aattacgttc 
cttttgctaa 
attcacgaag 
caccgcccgg 
gaccagccgc 
ctagtacacc 
gcatacgctg 



aaccttcttc 
gtcggcatta 
gagttcataa 
agcggtaaat 
aggtaattac 
atagaaagag 
ttatatttct 
ggagtccaaa 
aacttatccc 
gagtttatca 
tttatgtttc 
tggttcaaat 
aagttcattt 
ctgctttctt 
atattttaaa 
ttgaagaata 
cgtagcgaaa 
ccaagagttt 
tgcctggcag 
ttcaaatccg 
ataaaacgaa 
ccctactctc 
gttcggcatg 
cagaccgctt 
aaaacaggat 
aacaaacagc 
gaattgcccc 
ttcaccagct 
gcctttaatt 
tacaaccaca 
tataattaat 
cggctccatg 
caaaatcaac 
gccgcatgct 
tattgttaac 
ttctggaagt 



aactaacggg 
tctcatatta 
acaatcctgc 
atattgaatt 
tattattatt 
aaaaagcatt 
ctacatcaga 
taccagagaa 
aataacctaa 
cccttgtcac 
ggtataaaac 
aatgattaaa 
gatatgcctc 
cattagaatc 
aatatcccac 
aagaccacat 
aatccttttc 
gtagaaacgc 
tttatggcgg 
ctcccggcgg 
aggcccagtc 
gcatggggag 
gggtcaggtg 
ctgcgttctg 
cctcaactgt 
tgcgtcactc 
tgtttcgcgg 
tctaaaatat 
gttaaatcag 
ttcagaagac 
ccggcacatg 
aattgatacg 
ccgtcaactt 
tcaatcgcat 
atgtcagcag 
aagtttgaga 



gcaggttagt 
taaaagccag 
atgataacca 
acctttatta 
gatatttaag 
ttcaggtata 
aaggtataaa 
tgttttagat 
ctctccgtcg 
taagaaaata 
actaatatca 
tatctctttt 
ctaaattttt 
aatccttttt 
tttatccaat 
taaaaaatgt 
tttcttatct 
aaaaaggcca 
gcgtcctgcc 
^tttgtccta 
tttcgactga 
accccacact 
ggaccaccgc 
atttaatctg 
tttatacaag 
ggtgatcttc 
ctacaatacg 
gcggagcacg 
cttcaaagct 
ctaaatgagg 
ttccaaacat 
tatcgatcaa 
cgttcagctg 
ggatgtgctc 
cgctcctatg 
tttcaatccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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01 

o 
w 

m 



JB 

b 
y 

FU 

fy 

ni 



tttcattgca 
agttgttgct 
agaaccaaca 
tgctacgccg 
aagaagctcg 
cgcaactact 
attacctgtt 
tgttgcttca 
taaatgaaat 
catagcttca 
gatagatact 
tccagccgcg 
ttcttcagcg 
catttctgcc 
tctagagcgt 
ctttttgcaa 
ctttccgctc 
gcagttggtt 
tttttacagt 
ttcggtcgac 
ccttccccat 
tgctgtccag 
ttaccagcct 
cgagcacatg 
ccgcgttgcg 
cctcgctaac 
gaatgcgcaa 
cacgcggcgc 
gtcgttgagg 
gatacgcgag 
atgaatggtc 
caccattatg 
atctgtatta 
ccataccgcc 
aacccgtatc 
ttccccctta 
cgctttatca 
gaacaggcag 
ctcgcgcgtt 
aeagcttgtc 
gttggcgggt 
ggcttaacta 
taccgcacag 
ctgactcgct 
taatacggtt 
agcaaaaggc 
cccctgacga 
tataaagata 
tgccgcttac 
gctcacgctg 
acgaaccccc 
acccggtaag 
cgaggtatgt 
gaaggacagt 
gtagctcttg 
agcagattac 
ctgacgctca 
ggatcttcac 
atgagtaaac 
tctgtctatt 
gggagggctt 



gtaccaagct 
tccacaattg 
aatactccgt 
ccagcggcaa 
taaggagcac 
ttgcgcactt 
ccaggctcac 
ccaagatcac 
tcttcgtcag 
agcacacgcg 
gcattcatta 
cgaatatctg 
attttcgctt 
attccgcgtt 
cctgctgttg 
aatttttcag 
ttgtgctgtt 
aaaaggaaac 
cacagccact 
ctgcaggcat 
tatgattctt 
gcaggtagat 
aacttcgatc 
gaacgggttg 
tcgcggtgca 
ggattcacca 
accaaccctt 
atctcgggca 
acccggctag 
cgaacgtgaa 
ttcggtttcc 
ttccggatct 
acgaagcgct 
agttgtttac 
gtgagcatcc 
cacggaggca 
gaagccagac 
acatctgtga 
tcggtgatga 
tgtaagcgga 
gtcggggcgc 
tgcggcatca 
atgcgtaagg 
gcgctcggtc 
atccacagaa 
caggaaccgt 
gcatcacaaa 
ccaggcgttt 
cggatacctg 
taggtatctc 
cgttcagccc 
acacgactta 
aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatctggc 



cttttgacaa 
ctttcgcaaa 
cagcaccaag 
agttaacgac 
ctaggttttt 
gagcgttaac 
cttttgtgcg 
ggcagccaca 
ccggcgtcag 
cttcaacaat 
cttcttccac 
ctggcacacg 
gttccgcatt 
ttacacgttc 
ttaagattat 
gaattttagc 
aggatatctt 
aaaaagaccg 
tttgcaaaaa 
gcaagcttcg 
ctcgcttccg 
gacgaccatc 
actggaccgc 
gcatggattg 
tggagccggg 
ctccaagaat 
ggcagaacat 
gcgttgggtc 
gctggcg'ggg 
gcgactgctg 
gtgtttcgta 
gcatcgcagg 
ggcattgacc 
^cctcacaacg 
tctctcgttt 
tcaagtgacc 
attaacgctt 
atcgcttcac 
cggtgaaaac 
tgccgggagc 
agccatgacc 
gagcagattg 
agaaaatacc 
gttcggctgc 
tcaggggata 
aaaaaggccg 
aatcgacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 
aactcacgtt 
ttaaattaaa 
agttaccaat 
atagttgcct 
cccagtgctg 



ctcagcgatt 
tttagcaggg 
ctgcatcatg 
aggaagcttg 
cgcttctgtc 
tttacgcata 
aagcatagaa 
gacaaaagga 
aacttcactt 
atgtccgata 
gattgtaggg 
ttctagcgcc 
gatgacgtcc 
agtacctgtt 
tataccacac 
agaggttgtt 
tcttggaagc 
ttttcacaca 
ccggacagct 
cgaagcggcc 
gcggcatcgg 
agggacagct 
tgatcgtcac 
taggcgccgc 
ccacctcgac 
tggagccaat 
atccatcgcg 
ctggccacgg 
ttgccttact 
ctgcaaaacg 
aagtctggaa 
atgctgctgg 
ctgagtgatt 
ttccagtaac 
catcggtatc 
aaacaggaaa 
ctggagaaac 
gaccacgctg 
ctctgacaca 
agacaagccc 
cagtcacgta 
tactgagagt 
gcatcaggcg 
ggcgagcggt 
acgcaggaaa 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccccgt 
caatgatacc 



aatttgtaat 
ttgtctgatt 
agagcagcat 
ccgtcttttt 
attagctcat 
tggcgaacag 
gcaccttccg 
actgtgtatt 
tcatcaatat 
cgcgcttttg 
tcagccatac 
attacagcga 
atgatgacgc 
tgagccatgt 
cttgtagata 
ctggatgtag 
taggtaggcc 
aaacggtctt 
tcatgcctta 
gccgacgcga 
gatgcccgcg 
tcaaggatcg 
ggcgatttat 
cctatacctt 
ctgaatggaa 
caattcttgc 
tccgccatct 
gtgcgcatga 
ggttagcaga 
tctgcgacct 
acgcggaagt 
ctaccctgtg 
tttctctggt 
cgggcatgtt 
attaccccca 
aaaccgccct 
tcaacgagct 
atgagcttta 
tgcagctccc 
gtcagggcgc 
gcgatagcgg 
gcaccatatg 
ctcttccgct 
atcagctcac 
gaacatgtga 
gtttttccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
ctccaagctg 
taactatcgt 
tggtaacagg 
gcctaactac 
taccttcgga 
tggttttttt 
tttgatcttt 
ggtcatgaga 
taaatcaatc 
tgaggcacct 
cgtgtagata 
gcgagaccca 



cagtaaagtg 
taaaaatacc 
cagctggagt 
taatttgaag 
■cctcactcat 
cctcaacaat 
caatacggcg 
catttttatt 
agtcaacacc 
ccattaccgg 
gggcaactcc 
cagctccagc 
cgcctttttg 
tttctcctcc 
aagtcaacaa 
aacaaaacat 
tcgagttatg 
tttcgatttc 
taactgctgt 
ggctggatgg 
ttgcaggcca 
ctcgcggctc 
gccgcctcgg 
gtctgcctcc 
gccggcggca 
ggagaactgt 
ccagcagccg 
tcgtgctcct 
atgaatcacc 
gagcaacaac 
cagcgccctg 
gaacacctac 
cccgccgcat 
catcatcagt 
tgaacagaaa 
taacatggcc 
ggacgcggat 
ccgcagctgc 
ggagacggtc 
gtcagcgggt 
agtgtatact 
cggtgtgaaa 
tcctcgctca 
tcaaaggcgg 
gcaaaaggcc 
aggctccgcc 
ccgacaggac 
gttccgaccc 
ctttctcata 
ggctgtgtgc 
cttgagtcca 
attagcagag 
ggctacacta 
aaaagagttg 
gtttgcaagc 
tctacggggt 
ttatcaaaaa 
taaagtatat 
atctcagcga 
actacgatac 
cgctcaccgg 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4 920 
4980 
5040 
5100 
,5160 
5220 
5280 
5340 
5400 
54 60 
5520 
5580 
5640 
5700 
5760 
5820 



Atty. Docket No.: BGI- 152-1 



7 



ctccagattt atcagcaata aaccagccag 
caactttatc cgcctccatc cagtctatta 
cgccagttaa tagtttgcgc aacgttgttg 
cgtcgtttgg tatggcttca ttcagctccg 
cccccatgtt gtgcaaaaaa gcggttagct 
agttggccgc agtgttatca ctcatggtta 
tgccatccgt aagatgcttt tctgtgactg 
agtgtatgcg gcgaccgagt tgctcttgcc 
atagcagaac tttaaaagtg ctcatcattg 
ggatcttacc gctgttgaga tccagttcga 
cagcatcttt tactttcacc agcgtttctg 
caaaaaaggg aataagggcg acacggaaat 
attattgaag catttatcag ggttattgtc 
agaaaaataa acaaataggg gttccgcgca 
aagaaaccat tattatcatg acattaacct 
gtcttcaaga a 



ccggaagggc cgagcgcaga agtggtcctg 5880 
attgttgccg ggaagctaga gtaagtagtt 5940 
ccattgctgc aggcatcgtg gtgtcacgct 6000 
gttcccaacg atcaaggcga gttacatgat 6060 
ccttcggtcc tccgatcgtt gtcagaagta 6120 
tggcagcact gcataattct cttactgtca 6180 
gtgagtactc aaccaagtca ttctgagaat 6240 
cggcgtcaat acgggataat accgcgccac 6300 
gaaaacgttc ttcggggcga aaactctcaa 6360 
tgtaacccac tcgtgcaccc aactgatctt 6420 
ggtgagcaaa aacaggaagg caaaatgccg 6480 
gttgaatact catactcttc ctttttcaat 6540 
tcatgagcgg atacatattt gaatgtattt 6600 
catttccccg aaaagtgcca cctgacgtct 6660 
ataaaaatag gcgtatcacg aggccctttc 6720 

6731 



<210> 8 
<211> 10181 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid - pDX17R 



<400> 8 

gaattttgcg 

ttagtgacat 

gccagtcatt 

aaccatcaca 

tattaatgaa 

ttaagttaaa 

gtataggtgt 

ataaatcata 

tagatacacc 

cgtcgctatt 

aaataaatgc 

tatcaatttc 

cttttctctt 

tttttatcta 

ttttttaaaa 

ccaattttcg 

aatgtggtct 

tatcttgata 

ggccatccgt 

ctgcccgcca 

tcctactcag 

actgagcctt 

acactaccat 

accgcgctac 

atctgtatca 

acaagtgcct 

tcttctgtca 

atacgaccat 

gcacggatga 

aagctgtcaa 

tgaggattat 

aacatcggtt 

atcaaacggc 



gccgcttcga 

tagaaaaccg 

aggcctatct 

aacagaatga 

ttttcctgct 

cccagtaaat 

tttgggaaac 

aaactctttg 

atcaaaaatt 

gtaaccagtt 

agggtaaaat 

tgtggttata 

ccaattgtct 

aagtgaattt 

gtcaatatta- 

tttgttgaac 

tttgtgtttt 

ataagggtaa 

caggatggcc 

ccctccgggc 

gagagcgttc 

tcgttttatt 

cggcgctacg 

tgccgccagg 

ggctgaaaat 

tttgcttata 

gctccggatg 

tatgctccga 

ataccccagt 

cctgccggcc 

ctgaaccggc 

tgccctgagc 

gcatcgtcgt 



aagctgtaat 
actgtaaaaa 
gacaattcct 
tgtacctgta 
gtaataatgg 
gaagtccatg 
aatttccccg 
aagtcattct 
gtataaagtg 
ctaaaagctg 
ttatatcctt 
ctaaaagtcg 
aaatcaattt 
aggaggctta 
ctgtaacata 
taatgggtgc 
tttaaaggat 
ctattgaatt 
ttctgcttaa 
cgttgcttcg 
accgacaaac 
tgatgcctgg 
gcgtttcact 
caaattctgt 
cttctctcat 
ttcctcaacc 
gaatgagcag 
tagaacttca 
aaaaggctcg 
aaatgaatta 
aatttctttt 
agcgaattca 
gctctcaccg 



ataaaaacct 
gtacagtcgg 
gaatagagtt 
aagatagcgg 
gtagaaggta 
gaataataga 
aaccattata 
ttacaggagt 
gctctaactt 
tatttgagtt 
cttgttttat 
tttgttggtt 
tattaaagtt 
cttgtctgct 
aatatatatt 
tttagttgaa 
ttgagcgtag 
cggtaccaag 
tttgatgcct 
caacgttcaa 
aacagataaa 
cagttcccta 
tctgagttcg 
tttatcagac 
ccgccaaaac 
atttcaacaa 
ccaaggaatt 
acattttcac 
tccaagcctt 
cgttctacaa 
gctaatataa 
cgaagcggct 
cccggcaaaa 



tcttcaacta 
cattatctca 
cataaacaat 
taaatatatt 
attactatta 
aagagaaaaa 
tttctctaca 
ccaaatacca 
atcccaataa 
tatcaccctt 
gtttcggtat 
caaataatga 
catttgatat 
ttcttcatta 
ttaaaaatat 
gaataaagac 
cgaaaaatcc 
agtttgtaga 
ggcagtttat 
atccgctccc 
acgaaaggcc 
ctctcgcatg 
gcatggggtc 
cgcttctgcg 
aggatcctca 
acagctgcgt 
gcccctgttt 
cagcttctaa 
taattgttaa 
ccacattcag 
ttaatccggc 
ccatgaattg 
tcaacccgtc 



acggggcagg 
tattataaaa 
cctgcatgat 
gaattacctt 
ttattgatat 
gcattttcag 
tcagaaaggt 
gagaatgttt 
cctaactctc 
gtcactaaga 
aaaacactaa 
ttaaatatct 
gcctcctaaa 
gaatcaatcc 
cccactttat 
cacattaaaa 
ttttctttct 
aacgcaaaaa 
ggcgggcgtc 
ggcggatttg 
cagtctttcg 
gggagacccc 
aggtgggacc 
ttctgattta 
actgttttat 
cactcggtga 
cgcggctaca 
aatatgcgga 
atcagcttca 
aagacctaaa 
acatgttcca 
atacgtatcg 
aacttcgttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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agctgctccg gacgttttac gacaagacca 
tgctctctaa ctgctccttg aagtcctagt 
ctatgttctt accagccgcg ttcttgcata 
atccctttca ttgcagtacc aagctctttt 
aagtgagttg ttgcttccac aattgctttc 
ataccagaac caacaaatac tccgtcagca 
ggagttgcta cgccgccagc ggcaaagtta 
tgaagaagaa gctcgtaagg agcacctagg 
ctcatcgcaa ctactttgcg cacttgagcg 
acaatattac ctgttccagg ctcacctttt 
cggcgtgttg cttcaccaag atcacggcag 
ttatttaaat gaaattcttc gtcagccggc 
acacccatag cttcaagcac acgcgcttca 
accgggatag atactgcatt cattacttct 
actcctccag ccgcgcgaat atctgctggc 
ccagcttctt cagcgatttt cgcttgttcc 
ttttgcattt ctgccattcc gcgttttaca 
cctcctctag aacaggcggg gttgcccccg 
ggaaagtcaa tacctttttg taaaattttt 
ctttgtaccc ctcaagggag aaatattggc 
gtgaaccata gctgtcgtca atagcctcga 
agaccgtttt cacacaaaac ggtctttttc 
caaaaaccgg acagcttcat gccttataac 
ttcttctgca agccaaaaaa ccttccgtta 
tcggcgagtt tcatccctct gtcccagtcc 
tctatctatt ttaataatag gtgcagttcg 
aaatcaacgg gcttgtacca acacattagc 
taatgccttc gttcgtttct aaaagcagaa 
cgtaaaaggt tccgtttaac gttctggctc 
tttcccataa aagcttaatc ggcgtaaatc 
caaagcatag taaaatatgc tggatgacgc 
ggctgaggct tgtcgcgatg tccttcaatt 
cgttaatgcc gatcccaatg atcactgaac 
ttaggatacc gactgttttt tttccgttaa 
tttggatgcc tgctgcctct tctattccct 
gctgcggtgt tttttggagc ggaatgtcag 
ttccttcttg agaatgccat accctagaca 
ccaccacaag ggtgccttcc ggtgcgttat 
tgcttgaaag aacgtcatgg taaataagat 
aacgaatttc gctttcactg agttttccgg 
taacggcttc tacttcataa ccctctttcc 
ttcttgaaca gccgagagca tcactgattt 
gagaaaataa ttcaataagg tcttttctta 
tatgtgtttc ttttgattgg agagcttccc 
ttcttctgac acccattttc cggccggccg 
gatatcaaga tccttaaggc ttttgatcgg 
cagtttcttt tcatccagtt tttcgttttg 
cagtgctttt ttcccagctc tgtacattgt 
ggcaatgtga atggcttcct tgatcacttt 
gggcgcgtct ttcaaaacta tgccaaggtt 
ggatgttaaa gagaagaatg gaaactcact 
aaaaccagga agctcttcat acaatctcgt 
agctcttccc tgcagcaatt tctcaaactc 
gaggagtgat ttttctttcg caatggcttc 
ctggctcata aagcgtacgg ctctcagcat 
aggctttcca acggttcgaa tcactttctg 
aagcaccttc ccgtccgctg tcatggccat 
atcctcttct aatgatgaga taaattgcac 
agattcagtc cggaatgtcg tgacttcata 
cccgtgctct ttgcctacat caacagtccg 
cgcatctgtc gcgatatcga catctccgat 



gccgcgccgc atgcttcaat cgcatggatg 2040 
acacctattg ttaacatgtc agcagcgctc 2100 
cgctgttctg gaagtaagtt tgagatttca 2160 
gacaactcag cgattaattt gtaatcagta 2220 
gcaaatttag cagggttgtc tgatttaaaa 2280 
ccaagctgca tcatgagagc agcatcagct 2340 
acgacaggaa gcttgccgtc ttttttaatt 24 00 
tttttcgctt ctgtcattag ctcatcctca 24 60 
ttaactttac gcatatggcg aacagcctca 2520 
gtgcgaagca tagaagcacc ttccgcaata 2580 
ccacagacaa aaggaactgt gtattcattt 2640 
gtcagaactt cactttcatc aatatagtca 2700 
acaatatgtc cgatacgcgc ttttgccatt 2760 
tccacgattg tagggtcagc catacgggca 2820 
acacgttcta gcgccattac agcgacagct 2880 
gcattgatga cgtccatgat gacgccgcct 2940 
cgttcagtac ctgtttgagc catgttttct 3000 
cctgtaatta aattattaca- caccctgtag 3060 
acacagcgtg gatctcttct agggacacct 3120 
ggtactgagc acagttttgg ttggtggaca 3180 
gttatggcag ttggttaaaa ggaaacaaaa 3240 
gatttctttt tacagtcaca gccacttttg 3300 
tgctgtttcg gtcgacgatg atctgccgtt 3360 
caacgagaag gattcttcac tttctaaagt 3420 
ttttttggat caaggcagac tgctgcaatg 3480 
caggcgatac tgcccaatgg aagtatacca 3540 
ccaattcgat atcggcagaa tagatttttt 3600 
cgccttcatc atctatacct aacgccttac 3660 
tcatattagt gccaataccg agcgcatagc 3720 
cgtgcgtcat ataatcccgg taccgtttct 3780 
cggcccgatc aattttttcc ccagcagctt 3840 
catctggaaa atcattaggc tgctggttaa 3900 
gtacgcggtc ttcttcagcc tgcatttccg 3960 
tcaaaatatc atttggccat ttaatatccg 4020 
gcacaacagc tactgcagca agcagagtca 4080 
gccgcaaaat caggctcatc caaacaccgt 4140 
ttcggcccct tccggctgtt tgtttgtcag 4200 
tattcgcgag ctcatgagcc gttttttgcg 4260 
gctggcccat cacttccgtt tttaatccaa 4320 
gttttttgat gagccgatat ccttttcttc 4380 
gaagctcttc aatatgcttc cacacagcag 4 440 
tttggccgga aataaattca ttgccggcct 4500 
atgttgaccg catgtcttca gccactcctc 4560 
tgtcacaaca gcctgctcga tccactgtaa 4 620 
gtttcgaagc gcaagcaagt ccttacccgt 4 680 
caggttttga taagcgtact gaatgtcctt 4740 
ccgaagctgc gatattttgg ccgctgagag 4800 
cattgcgtca aggctctggc caaacgtatc 4860 
tcccgggagc ttccaggctt tcaggaaaag 4920 
aattaaaaga gcagcccaaa gctcctcacg 4980 
cgttgaaatc aggttttctc gtttatgata 5040 
ttgaatgagt gtttgaagcg cctggcgaga 5100 
tatagttttt cgttcgactg aaacatggga 5160 
ttctgtttcc ggtgaaagcg taaagccaag 5220 
acgaagcgca tcctcttgaa atctatcctc 5280 
atcaatatct ttcttgccgc caaaataatc 5340 
cgcattgatc gttaaatctc tgcgttttag 5400 
ttctgacggt cttctgaaat caacataatc 54 60 
ggtttcatcc tcccagagca caataatggt 5520 
ctgaaacagc cgttctactt gatcaggtgc 5580 
cgttcgtttc atatagctgt cacgaactgc 5640 
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gcccccgaca aaataagcct gatggcccgc 
tttgataaaa actttttcca tgtgatcact 
catactggct gacaattttt ttagaagaaa 
ttgtaaaacg attgcttagc tgttcatctt 
ccgtaacatc accgacatcc accaaaaatc 
taccgccaat gtttgttcca atacaaggca 
ggccaaagct ttctttttca gatagcagca 
caacacggtc ttgatttcca agcattaaga 
gctcgcaggc tgtcgatttc tccggaccgt 
tgccagcgat attgcggaac acacggatga 
ggattagcag agcgaggtat gtaggcggtg 
acggctacac tagaaggaca gtatttggta 
gaaaaagagt tggtagctct tgatccggca 
ttgtttgcaa gcagcagatt acgcgcagaa 
tttctacggg gtctgacgct cagtggaacg 
gattatcaaa aaggatcttc acctagatcc 
tctaaagtat atatgagtaa acttggtctg 
ctatctcagc gatctgtcta tttcgttcat 
taactacgat acgggagggc ttaccatctg 
cacgctcacc ggctccagat ttatcagcaa 
gaagtggtcc tgcaacttta tccgcctcca 
gagtaagtag ttcgccagtt aatagtttgc 
tggtgtcacg ctcgtcgttt ggtat-ggctt 
gagttacatg atcccccatg ttgtgcaaaa 
ttgtcagaag taagttggcc gcagtgttat 
ctcttactgt catgccatcc gtaagatgct 
cattctgaga atagtgtatg cggcgaccga 
ataccgcgcc acatagcaga actttaaaag 
gaaaactctc aaggatctta ccgctgttga 
ccaactgatc ttcagcatct tttactttca 
ggcaaaatgc cgcaaaaaag ggaataaggg 
tcctttttca atattattga agcatttatc 
ttgaatgtat ttagaaaaat aaacaaatag 
cacctgacgt ctaagaaacc attattatca 
cgaggccctt tcgtctcgca tgcggatcag 
ggatttcgat cacggcacga tcatcgtgcg 
gatgttaccc gagagcttgg cacccagcct 
accttttgaa tgacctttaa tagattatat 
cttttttatt ttaaaaatt't tttcacaaaa 
cgcaaacggg ctgttctggt gttgctagtt 
tttgccggct gaaagcgcta tttcttccag 
gtcactggct cccgtgttgt cggcagcttt 
ctgtctatgt gtgactgttg agctgtaaca 
agttgctttg ttttactggt ttcacctgtt 
acattgtcga tctgttcatg gtgaacagct 
gatgtatcta tcttttttac accgtt.ttca 
atctaacggt gaacagttgt tctacttttg 
acaagagcca taagaacctc agatccttcc 
tcgttgtttt tgcgtgagcc atgagaacga 
actcaaaaat tttgcctcaa aactggtgag 
tgtttttctt agtccgttac gtaggtagga 
accattcatt tttatct'ggt tgttctcaag 
ttcaacttgg aaaatcaacg tatcagtcgg 
attgctgtaa gtgtttaaat ctttacttat 
aaactcatgg tagttatttt caagcattaa 
tatatttgcc ttgtgagttt tcttttgtgt 
catagagtat ttgttttcaa ' aagacttaac 
taactggaaa agataaggca atatctcttc 
gaacttggca tagtttgtcc actggaaaat 
ttccacagtt ctcgtcatca gctctctggt 
cctactgatg ttcatcatct gagcgtattg 



ttcgattaag atgcggagca cgggaagtgc 5700 
ccggttctgc taaatcggca taaatctgtt 5760 
attcattttc aagcatctct attgccgcct 5820 
ctaaaatgct catcgcgcgg gctgttgcgg 5880 
cgctcacatt gttttttata acctcaggga 5940 
ctccgcaagc catcgcttca agcaggacaa 6000 
gcttcaaatc gctaatagaa taaagatctt 6060 
cttggtcttc caagccatat tttctgataa 6120 
ctccgactaa aagcagcttc gctttcgttt 6180 
catcctgcac gcgtgcagcc actggtaaca 6240 
ctacagagtt cttgaagtgg tggcctaact 6300 
tctgcgctct gctgaagcca gttaccttcg 6360 
aacaaaccac cgctggtagc ggtggttttt 6420 
aaaaaggatc tcaagaagat cctttgatct 6480 
aaaactcacg ttaagggatt ttggtcatga 6540 
ttttaaatta aaaatgaagt tttaaatcaa 6600 
acagttacca atgcttaatc agtgaggcac 6660 
ccatagttgc ctgactcccc gtcgtgtaga 6720 
gccccagtgc tgcaatgata ccgcgagacc 6780 
taaaccagcc agccggaagg gccgagcgca 6840 
tccagtctat taattgttgc cgggaagcta 6900 
gcaacgttgt tgccattgct acaggcatcg 6960 
cattcagctc cggttcccaa cgatcaaggc 7020 
aagcggttag ctccttcggt cctccgatcg 7080 
cactcatggt tatggcagca ctgcataatt 7140 
tttctgtgac tggtgagtac tcaaccaagt 7200 
gttgctcttg cccggcgtca atacgggata 7260 
tgctcatcat tggaaaacgt tcttcggggc 7320 
gatccagttc gatgtaaccc actcgtgcac 7380 
ccagcgtttc tgggtgagca aaaacaggaa 74 40 
cgacacggaa atgttgaata ctcatactct 7500 
agggttattg tctcatgagc ggatacatat 7560 
gggttccgcg cacatttccc cgaaaagtgc 7620 
tgacattaac ctataaaaat aggcgtatca 7680 
tgagggtttg caactgcggg tcaaggatct 7740 
ggagggcaag ggctccaagg atcgggcctt 7800 
gcgcgagcag gggaattgat ccggtggatg 7860 
tactaattaa ttggggaccc tagaggtccc 7920 
cggtttacaa gcataacggg ttttgctgcc 7980 
tgttatcaga atcgcagatc cggcttcagg 8040 
aattgccatg attttttccc cacgggaggc 8100 
gattcgataa gcagcatcgc ctgtttcagg 8160 
agttgtctca ggtgttcaat ttcatgttct 8220 
ctattaggtg ttacatgctg ttcatctgtt 8280 
ttaaatgcac caaaaactcg taaaagctct 8340 
tctgtgcata tggacagttt tccctttgat 8400 
tttgttagtc ttgatgcttc actgatagat 84 60 
gtatttagcc agtatgttct ctagtgtggt 8520 
accattgaga tcatgcttac tttgcatgtc 8580 
ctgaattttt gcagttaaag catcgtgtag 8640 
atctgatgta atggttgttg gtattttgtc 8700 
ttcggttacg agatccattt gtctatctag 8760 
gcggcctcgc ttatcaacca ccaatttcat 8820 
tggtttcaaa acccattggt taagcctttt 8880 
catgaactta aattcatcaa ggctaatctc 8940 
tagttctttt aataaccact cataaatcct 9000 
atgttccaga ttatatttta tgaatttttt 9060 
actaaaaact aattctaatt tttcgcttga 9120 
ctcaaagcct ttaaccaaag gattcctgat 9180 
tgctttagct aatacaccat aagcattttc 9240 
gttataagtg aacgataccg tccgttcttt 9300 
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ccttgtaggg 
ttcatgctcc 
ttcagacata 
tagtcaatga 
gacctttgct 
gtgtgttttt 
aaagataaaa 
gtattacaaa 
taaaggctta 
cgaccatcag 
ctctggcagt 
aactacccat 
tgctatgtgg 
tctgaccact* 
gcagcggtat 



ttttcaatcg 
gttaagtcat 
catctcaatt 
taattactag 
ggaaaacttg 
tttgtttata 
agaatagatc 
aggatgtcgc 
agtagcaccc 
gcacctgagt 
gaatgggggt 
aatacaagaa 
tgctatctga 
tcggattatc 
catcaacagg 



tggggttgag 
agcgactaat 
ggtctaggtg 
tccttttcct 
taaattctgc 
ttcaagtggt 
ccagccctgt 
aaacgctgtt 
tcgcaagctc 
cgctgtcttt 
aaatggcact 
aagcccgtca 
ctttttgctg 
ccgtgacagg 
cttacccgtc 



tagtgccaca 
cgctagttca 
attttaatca 
ttgagttgtg 
tagaccctct 
tataatttat 
gtataactca 
tgctcctc.ta 
gggcaaatcg 
ttcgtgacat 
acaggcgcct 
cgggcttctc 
ttcagcagtt 
tcattcagac 
ttactgtcaa 



cagcataaaa 
tttgctttga 
ctataccaat 
ggtatctgta 
gtaaattccg 
agaataaaga 
ctactttagt 
caaaacagac 
ctgaatattc 
tcagttcgct 
tttatggatt 
agggcgtttt 
cctgccctct 
tggctaatgc 



ttagcttggt 
aaacaactaa 
tgagatgggc 
aattctgcta 
ctagaccttt 
aagaataaaa 
cagttccgca 
cttaaaaccc 
cttttgtctc 
gcgctcacgg 
catgcaagga 
atggcgggtc 
gattttccag 
acccagtaag 



9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10181 



<210> 9 

<211> 194 

<212> DNA 
iTs <213> Artificial Sequence 

□ 

UJ <220> 

<223> promoter sequence 

f~j <400> 9 

^ gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 

tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 
ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
^_ gcaaccccgc ctgt 194 

□ 

y <2io> io 

HI <211> 163 

<212> DNA 

<213> Artificial Sequence 



*0 



ra 
o 
m 



<220> 

<223> promoter sequence 
<400> 10 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 

<210> 11 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> promoter sequence 



<400> 11 

gaggaatcat agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 
aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
tgcaatg 127 



<210> 12 
<211> 42 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 12 

gaaatcatat aactatacct tgattagggg gaccaagaaa tg 

<210> 13 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 13 

caagaacgcg gctggtaaga acataggagc gctgctgaca tg 

<210> 14 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 14 

tctagaaagg aggtga 

<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 15 

tctagaagga ggagaaaaca tg 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding, site 
<400> 16 

tctagaggag gagaaaacat g 

<210> 17 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 17 
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taagaacaaa ggaggagagc tgacatg 

<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 18 

taagaacaga ggaggagagc tgacatg 

<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 19 

taagaagaaa ggaggtgagc tgacatg 



